Google I/O 2024: Gemini AI gets new capabilities to rival OpenAI's ChatGPT

Google I/O 2024: Gemini AI gets new capabilities to rival OpenAI's ChatGPT



Google I/O 2024 kicked off with a keynote address focused on Gemini, its artificial intelligence (AI) model that is set to get new capabilities to become the foundational model powering its services such as Search, Photos, Workspace, Android, and more. 


With Gemini, Google said, the goal is to make AI helpful for everyone. On that note, Google announced that it is expanding “AI overviews in Search” to everyone in US this week and to more countries soon. While this was long time coming, Google threw in a surprise with Gemini-powered “Ask Photos” feature for Google Photos. It essentially lets you search your entire library on Google Photos and follow-up the results with even more complex prompts. More details on the “Ask Photos” will be available later this year, which is when the feature is slated to roll out. 


About the Gemini itself, the model has been updated with new capabilities, said Google. Called Gemini 1.5 Pro, the new and improved version will be available to all developers globally. In addition, Google announced that Gemini 1.5 Pro with one-million context is now directly available for consumers in Gemini Advanced. This can be used across 35 languages. 


Here is a roundup of everything Google announced at I/O 2024 keynote:


Gemini in Workspace


Google said that it is rolling-out the Gemini 1.5 Pro model to its paid-tier customers with a new side-panel on Workspace apps such as Gmail, Drive, Docs, Sheets and more. The side-panel resembles the Microsoft’s Copilot side-panel on desktops and offers better accessibility to AI from any Workspace app. 


Another feature coming to Workspace is the new Gemini AI teammate, which is essentially an AI-powered assistant for Workspace apps. The Gemini Teammate has its own Google Account and can be incorporated into groups within Chats.


Google Project Astra


Google’s Project Astra is a multimodal AI agent with real-time spatial understanding. Google said that the AI agent is capable of understanding objects in a physical space and can process the data in real-time. It can basically watch and remember what it sees through your device’s camera and can respond to prompts based on it. Google said that the AI agent will be powering the company’s Gemini product starting later this year.


AI in Search


One of the biggest takeaways from Google’s announcement is AI in Search. The search engine will soon get the ability to analyse and search based on video inputs, similar to how it does with images using Google Lens. 


Google said that the Search is backed by a custom Gemini AI model and gets improved contextual understanding. Search results get AI-powered overviews, which were previously part of the Search Generative Experience (SGE) and was available as an experimental feature. Leveraging the Gemini AI, Google said, Search can break longer queries into smaller parts for better understanding as well.


Circle to Search


Circle to search for Android is set to get new features. Google said that the updated version of the feature will allow users to simply circle a mathematical problem and Google’s AI will provide with steps that should make it easy to solve the question. 


Smarter Gemini Assistant for Android


Google said that the Gemini AI assistant for Android will soon be able to harness multimodality by understand the video playing on the display and let users ask questions based on the video. The assistant will also gain the ability to answer the user’s query based on a document such as a PDF files. 


Gemini is also getting a new “Live feature” that will allow it to understand live videos in real-time and will be able to hold a more natural conversation with the user. 


Gems: Custom Gemini chatbots


Google said that it will soon allow Gemini Advanced subscribers to create custom chatbots for carrying out a specific task. The feature is similar to custom GPTs on OpenAI’s ChatGPT. 


Scam call detection on Android


Using the on-device Gemini Nano model, select Android powered smartphones will soon be able to detect if the phone call received is a scam call. Google said that the feature will understand the conversation pattern during the phone call and will notify the user if it thinks the on-going call is a scam call. According to Google, call data will be processed on-device for privacy and security. 


Google Veo


Google is set to rival OpenAI’s Sora with its new generative AI model called Veo, which the company said will be able to generate videos in 1080p resolution. The model will generate videos based on text, image, and video-based prompts and will allow users to further edit the generated video with more prompts.

First Published: May 15 2024 | 9:49 AM IST



Source link

OpenAI's co-founder Ilya Sutskever parts ways with ChatGPT maker Altman

OpenAI's co-founder Ilya Sutskever parts ways with ChatGPT maker Altman



OpenAI co-founder and chief scientist Ilya Sutskever is leaving the startup at the center of today’s artificial intelligence boom.

 


“OpenAI would not be what it is without him,” OpenAI CEO Sam Altman wrote in a message to the company, which OpenAI posted on its blog.

 


Microsoft-backed OpenAI makes the popular ChatGPT chatbot, which sparked a race among the world’s largest tech companies for dominance in the emerging generative AI field.

 


Jakub Pachocki will be the company’s new chief scientist, the company said on its blog.

 


Pachocki has previously served as OpenAI’s director of research and led the development of GPT-4 and OpenAI Five.


“After almost a decade, I have made the decision to leave OpenAI,” Sutskever said in a post on X.

 


Sutskever posted that he is working on a new project “that is very personally meaningful to me about which I will share details in due time.”

 


Sutskever played a key role in Altman’s dramatic firing and rehiring in November last year. At the time, Sutskever was on the board of OpenAI and helped to orchestrate Altman’s firing.

 


Days later, he reversed course, signing onto an employee letter demanding Altman’s return and expressing regret for his “participation in the board’s actions.”

 


After Altman returned, Sutskever was removed from the board and his position at the company became unclear.

 


Sutskever’s exit comes a day after the company said at an event on Monday that it would release a new AI model called GPT-4o, capable of realistic voice conversation and able to interact across texts and images.

 


Shortly after launching in late 2022, ChatGPT was called the fastest application ever to reach 100 million monthly active users. However, worldwide traffic to ChatGPT’s website has been on a roller-coaster ride in the past year and is only now returning to its May 2023 peak, according to analytics firm Similarweb.

 

Sutskever has long been a prominent researcher in the AI field. Before founding OpenAI, he worked as a researcher at Google Brain, and was a postdoctoral researcher at Stanford, according to his personal website. He started his career working with Geoffrey Hinton, one of the so-called “godfathers of AI”.


(Only the headline and picture of this report may have been reworked by the Business Standard staff; the rest of the content is auto-generated from a syndicated feed.)

First Published: May 15 2024 | 9:33 AM IST



Source link

YouTube blocks access to Hong Kong protest anthem videos after court order

YouTube blocks access to Hong Kong protest anthem videos after court order


The action is not a worldwide first for the US technology sector or Google parent Alphabet, which has restricted items when legally required to do so. In China, it has removed content | File image


YouTube has blocked access to videos of a protest song in Hong Kong, days after court approved an injunction banning the song in the city.


Glory to Hong Kong was an anthem of anti-government protests in 2019.


YouTube said that it would comply with a removal order and block access to over 32 YouTube videos of the song that were deemed to be prohibited publications under the injunction.


Attempts to access the YouTube videos from Hong Kong on Wednesday showed that they were unavailable. A message showed saying that This content is not available on this country domain due to a court order.


In approving the government’s application to ban the song, the court agreed it could be weaponised and used to incite secession.


We are disappointed by the court’s decision but are complying with its removal order by blocking access to the listed videos for viewers in Hong Kong, YouTube, which is owned by Alphabet Inc., said in an emailed statement.


We’ll continue to consider our options for an appeal, to promote access to information, the company said, adding that it shared the concerns of human rights organisations about the chilling effect the ban would have on free expression online.


Links to the 32 videos on YouTube will also not show up on Google Search for users in Hong Kong, according to YouTube.


George Chen, co-chair of digital practice at Asia Group, a Washington-headquartered business and policy consultancy, said it is worth watching how aggressively Hong Kong authorities will be in ordering internet platforms to remove the song.


Chen, who was the former head of public policy for Greater China at Meta, said that if the government begins sending platforms hundreds of links to remove every day, that would likely undermine investor confidence in Hong Kong.


That will hurt Hong Kong’s reputation as a leading financial centre because we know how important a free flow of data and information means to a financial centre, he said. So the government should be very careful and be aware of some unintended consequences that may impact its economic recovery and investors’ confidence.


Internet and social media platforms such as YouTube typically have policies for removal requests from governments.


Glory to Hong Kong was often sung by demonstrators during massive anti-government protests in 2019. The song was later mistakenly played as the city’s anthem at international sporting events, instead of China’s March of the Volunteers, in mix-ups that upset city officials.


Authorities earlier arrested some residents who played the song in public under other offences, such as playing a musical instrument in public without a permit, local media reported.


Critics have said prohibiting broadcast or distribution of the song further reduces freedom of expression since Beijing launched a crackdown in the former British colony following the 2019 protests. They have also warned the ban might disrupt the operation of tech giants and hurt the city’s appeal as a business centre.

First Published: May 15 2024 | 9:17 AM IST



Source link

Google introduces AI in search, raising hopes for better results for users

Google introduces AI in search, raising hopes for better results for users


This bold and responsible approach is fundamental to delivering on our mission and making AI more helpful for everyone, Google CEO Sundar Pichai told a group of reporters. (Photo: Bloomberg)


Google on Tuesday rolled out a retooled search engine that will frequently favor responses crafted by artificial intelligence over website links, a shift promising to quicken the quest for information while also potentially disrupting the flow of money-making internet traffic.


The makeover announced at Google’s annual developers conference will begin this week in the U.S. when hundreds of millions of people will start to periodically see conversational summaries generated by the company’s AI technology at the top of the search engine’s results page.


The AI overviews are supposed to only crop up when Google’s technology determines they will be the quickest and most effective way to satisfy a user’s curiosity a solution mostly likely to happen with complex subjects or when people are brainstorming, or planning. People will likely still see Google’s traditional website links and ads for simple searches for things like a store recommendation or weather forecasts.


Google began testing AI overviews with a small subset of selected users a year ago, but the company is now making it one of the staples in its search results in the U.S. before introducing the feature in other parts of the world. By the end of the year, Google expects the recurring AI overviews to be part of its search results for about 1 billion people.


Besides infusing more AI into its dominant search engine, Google also used the packed conference held at a Mountain View, California, amphitheater near its headquarters to showcase advances in a technology that is reshaping business and society.


The next AI steps included more sophisticated analysis powered by Gemini a technology unveiled five months ago and smarter assistants, or agents,” including a still-nascent version dubbed Astra” that will be able to understand, explain and remember things it is shown through a smartphone’s camera lens. Google underscored its commitment to AI by bringing in Demis Hassabis, the executive who oversees the technology, to appear on stage at its marquee conference for the first time.


The injection of more AI into Google’s search engine marks one of the most dramatic changes that the company has made in its foundation since its inception in the late 1990s. It’s a move that opens the door for more growth and innovation but also threatens to trigger a sea change in web surfing habits.


This bold and responsible approach is fundamental to delivering on our mission and making AI more helpful for everyone, Google CEO Sundar Pichai told a group of reporters.


It also will bring new risks to an internet ecosystem that depends heavily on digital advertising as its financial lifeblood.


Google stands to suffer if the AI overviews undercuts ads tied to its search engine a business that reeled in $175 billion in revenue last year alone. And website publishers ranging from major media outlets to entrepreneurs and startups that focus on more narrow subjects will be hurt if the AI overviews are so informative that they result in fewer clicks on the website links that will still appear lower on the results page.


Based on habits that emerged during the past year’s testing phase of Google’s AI overviews, about 25 per cent of the traffic could be negatively affected by the de-emphasis on website links, said Marc McCollum, chief innovation officer at Raptive, which helps about 5,000 website publishers make money from their content.


A decline in traffic of that magnitude could translate into billions of dollars in lost ad revenue, a devastating blow that would be delivered by a form of AI technology that culls information plucked from many of the websites that stand to lose revenue.

The relationship between Google and publishers has been pretty symbiotic, but enter AI, and what has essentially happened is the Big Tech companies have taken this creative content and used it to train their AI models, McCollum said. We are now seeing that being used for their own commercial purposes in what is effectively a transfer of wealth from small, independent businesses to Big Tech.”

But Google found the AI overviews resulted in people in conducting even more searches during the technology’s testing because they suddenly can ask questions that were too hard before, said Liz Reid, who oversees the company’s search operations, told The Associated Press during an interview. She declined to provide any specific numbers about link-clicking volume during the tests of AI overviews.


In reality, people do want to click to the web, even when they have an AI overview, Reid said. They start with the AI overview and then they want to dig in deeper. We will continue to innovate on the AI overview and also on how do we send the most useful traffic to the web.


The increasing use of AI technology to summarize information in chatbots such as Google’s Gemini and OpenAI’s ChatGPT during the past 18 months already has been raising legal questions about whether the companies behind the services are illegally pulling from copyrighted material to advance their services. It’s an allegation at the heart of a high-profile lawsuit that The New York Times filed late last year against OpenAI and its biggest backer, Microsoft.


Google’s AI overviews could provoke lawsuits too, especially if they siphon away traffic and ad sales from websites that believe the company is unfairly profiting from their content. But it’s a risk that the company had to take as the technology advances and is used in rival services such as ChatGPT and upstart search engines such as Perplexity, said Jim Yu, executive chairman of BrightEdge, which helps websites rank higher in Google’s search results.


This is definitely the next chapter in search, Yu said. It’s almost like they are tuning three major variables at once: the search quality, the flow of traffic in the ecosystem and then the monetization of that traffic. There hasn’t been a moment in search that is bigger than this for a long time.

(Only the headline and picture of this report may have been reworked by the Business Standard staff; the rest of the content is auto-generated from a syndicated feed.)

First Published: May 15 2024 | 7:06 AM IST



Source link

Google I/O: Alphabet unveils beefed-up AI chatbot as competition heats up

Google I/O: Alphabet unveils beefed-up AI chatbot as competition heats up


The Pro model – starting with prompt sizes of up to 1 million tokens, or pieces of data – will also be available to subscribers to Google’s Gemini Advanced service.


Google parent Alphabet on Tuesday showed how it is building on artificial intelligence across its businesses, including a beefed-up Gemini chatbot and improvements to search, as it faces growing competition from OpenAI and other rivals.


At its annual I/O developer event in Mountain View, California, CEO Sundar Pichai said the company is rolling out AI Overviews to all users in the U.S. this week after a long period of public testing since last year.

 


The new AI features unveiled on Tuesday will help investors evaluate Alphabet’s progress as it races against Microsoft , OpenAI and other competitors to dominate the emerging technology.

 


Shares of Alphabet climbed during the product presentation and were last up about 1% at $172.50 on Tuesday afternoon.


“We are in the very early days of AI platforms,” Pichai said.

 


Google announced improvements to its Gemini Pro 1.5 model that is capable of making sense of a massive amount of data. On Tuesday, Google said it was doubling that amount, to 2 million tokens, meaning the AI potentially could answer questions when given thousands of pages of text or more than an hour of video to ingest.

 


The Pro model – starting with prompt sizes of up to 1 million tokens, or pieces of data – will also be available to subscribers to Google’s Gemini Advanced service.

 


In another sign of fierce competition between OpenAI and Google, the online search leader teased Veo, an AI model that it claims to be its most powerful yet for creating videos on a simple text command.

 


Google had released an earlier video-generation technology in January, only to be upstaged weeks later by OpenAI’s Sora.


The ChatGPT maker has promoted its film-conjuring software among Hollywood executives, enthralling and worrying the creative industry.

 


Google said that filmmaker Donald Glover has experimented with its AI. The company also previewed a new text-to-image model, Imagen 3, and it touted other artist collaborations.

 


The company announced a scaled-down version of Gemini called 1.5 Flash, which aims to lower the cost of deploying AI and speed up responses. Like the more capable version, Flash can take in large amounts of data while being optimized for chat applications, video and image captioning.

 


AI Overviews uses generative AI to synthesize information and answer more complex queries for which there is no simple answer on the Web.

 


Alphabet’s AI unit, Google DeepMind, has worked to build technology that can carry out day-to-day tasks for consumers.


Early results have manifested in Project Astra, a tool that can use a smartphone camera and draw conclusions about the world around it.

 


In a demo video shown during Google I/O, a user deployed it to identify a speaker and locate glasses they had left in another part of the room.

 


Microsoft-backed OpenAI on Monday showcased a new AI model called GPT-4o, which enables ChatGPT to respond via voice in real time and be interrupted – both hallmarks of realistic voice conversations that AI voice assistants like Google Assistant have found challenging.

First Published: May 14 2024 | 11:45 PM IST



Source link

Adobe Experience Platform apps to be available in India by year end

Adobe Experience Platform apps to be available in India by year end


India is among the company’s fastest-growing markets, with customers, including Air India, ICICI Bank, HDFC Bank, Bajaj Allianz, Tata Motors, and MakeMyTrip.


Adobe Experience Platform-based applications will be available for enterprise customers in the country via a data centre by the end of the calendar year, the company said on Tuesday.


The move will help Adobe Experience Platform-based applications’ users to store their data locally as well as reduce latency in accessing them, it said.


“Adobe Experience Platform-based applications will be available for enterprise customers via an India data centre later in the year. This will deliver on local data residency requirements and improve performance through lower latency,” the company said in a statement.


“Generative AI is driving a foundational shift in the relationship between brands and their customers in India, marking this as the era for businesses to drive profitable growth while delivering new digital experiences,” Adobe India Vice-President and Managing Director Prativa Mohapatra said.


The company said it has seen an increase in demand for Adobe Experience Platform-based applications from customers across banking financial services and insurance, telecom, manufacturing, and retail segments.


India is among the company’s fastest-growing markets, with customers, including Air India, ICICI Bank, HDFC Bank, Bajaj Allianz, Tata Motors, and MakeMyTrip.


“We are excited to meet their hyper-growth requirements with the availability of Adobe Experience Platform-based applications,hosted via an India-based data centre,” Mohapatra said.

(Only the headline and picture of this report may have been reworked by the Business Standard staff; the rest of the content is auto-generated from a syndicated feed.)

First Published: May 14 2024 | 11:35 PM IST



Source link

YouTube
Instagram
WhatsApp