Agents: All You Need-AYNA
Your guide to the evolving world of LLMs and AI agents
Opinion AI:
AI — Five predictions for 2025
Transforming Tomorrow: Key Shifts in AI Landscape
This year, we’ve achieved so much together. As I look to the future, I’m inspired by the pace and velocity the field is moving, it is my tiny effort to predict the unpredictable. This year I received a lot of good feedback, obliged and appreciated. It was just this year that Nvidia popped off to the tune of $2 trillion company. This year I published another newsletter Agents — All You Need Our motto is distribute the knowledge to new generation. As 2024 sails into the sunset, let’s celebrate the year 2025. And thank you for following me in Linkedin and X I recommend subscribe here
Pika Labs has released version 2.0 of its AI video generator with a major new feature that allows users to add their own images to AI-generated videos. The company calls this feature "Scene Ingredients."
Microsoft debuts Phi-4, a new generative AI model, in research preview
DeepSeek announces an open-source model that rivals advanced closed-source models like GPT-4o and Claude 3.5 globally.
ChatGPT’s AI search engine is rolling out to everyone - OpenAI's ChatGPT search engine, now available to all users, includes an optimized mobile version with advanced voice mode and features resembling traditional search engines, such as location-based results with images and maps
Meta AI Releases Apollo: A New Family of Video-LMMs Large Multimodal Models for Video Understanding - Apollo models introduce innovative techniques like fps sampling and dual vision encoders to enhance video understanding, achieving strong performance across video-language tasks while offering scalable solutions for real-world applications.
Meta AI Introduces Byte Latent Transformer (BLT): A Tokenizer-Free Model That Scales Efficiently - Meta AI's Byte Latent Transformer (BLT) eliminates tokenization by processing raw byte sequences into dynamic patches, improving efficiency, scalability, and robustness in language models compared to traditional tokenization-based architectures.
OpenAI announces a ChatGPT organizing system called Projects - OpenAI's new Projects feature in ChatGPT enhances user experience by allowing customization and organization of chats, integrating capabilities like Canvas support and web connection for tasks such as project management and personal website creation.
OpenAI cofounder Ilya Sutskever says the way AI is built is about to change - Ilya Sutskever predicts a shift in AI development due to the finite nature of data, leading to future AI systems that are more autonomous and capable of reasoning beyond current pattern-matching methods.
Google DeepMind releases FACTS Grounding benchmark for evaluating LLMs’ grounding and factuality in real-world inputs
GitHub announces Copilot access for free, offering GPT-4o and Claude 3.5 for VS Code users.
Google DeepMind unveils a new video model to rival Sora
Automated proposal generation:
Given one of seven topics (bias, coding, safety, multilinguality, factuality, math, or uncertainty) and 10 related papers found by the Semantic Scholar search engine, Claude 3.5 Sonnet generated 4,000 research ideas. The authors embedded the ideas using all-MiniLM-L6-v2 and removed duplicate ideas based on the cosine similarity of their embeddings. This left roughly 200 AI-generated ideas for each topic. For each remaining idea, the model generated a proposal.
Fei-Fei Li's startup turns photos into 3D worlds
Stanford professor Fei-Fei Li, dubbed the "godmother" of AI, on Monday released an early peek at what her AI startup has been working on: an engine that can turn still images into realistic three-dimensional worlds.
Upcoming Events
Jan. 14
Stanford Digital Economy Lab
9:00 – 10:00 am PT • Webinar