Access millions of academic papers, trending research topics, and breakthrough discoveries with our AI-powered search engine.
Google DeepMind Team
We introduce Gemini, a family of highly capable multimodal models, trained jointly across image, audio, video, and text data. Gemini models are highly efficient and achieve state-of-the-art performance on a wide range of tasks...
We introduce Llama 3, a new family of foundation language models ranging in scale from 8B to 70B parameters. We demonstrate that Llama 3 models are competitive with leading proprietary models on a wide range of benchmarks...
Sora is a text-to-video model that can generate videos up to a minute long while maintaining visual quality and adherence to the user's prompt. We investigate large-scale training of generative models on video data...