This Week I Learned - Week 8 2026
This Week I Learned -
* Brain Moore prompted nine different AI models to programmatically generate World Clocks to show how some of them struggle with the rendering. Every minute, a new clock is displayed. Each model is allowed 2000 tokens to generate its clock.
* India’s first sovereign LLM Sarvam AI's 30B-parameter model is pre-trained on 16 trillion tokens and supports a 32,000-token context length, enabling long conversations and agentic workflows while keeping inference costs low due to fewer activated parameters. It is a mixture-of-experts (MoE) model and has just 1 billion activated parameters, meaning that in generating every output token, it only activates 1B parameters.
* Alibaba Group's Qwen3.5-397B-A17B is a new open weight multimodal model built to be faster, cheaper, and more agent capable than its predecessors. It combines text, image, and video processing in a single architecture and uses a mixture of experts design where only 17 billion of its 397 billion parameters activate per query, helping reduce compute costs while keeping performance high.
* Claude Sonnet 4.6 is now available across all Claude plans, including Claude Cowork, Claude Code, the API, and major cloud platforms. Sonnet is designed for everyday coding and enterprise tasks, while Opus handles more complex reasoning and analytical work.
* Yann LeCun criticizes LLMs for lacking true reasoning, planning, persistent memory, and physical world grounding—they're good at language but not intelligent like humans or animals.
His bets: Joint Embedding Predictive Architecture (JEPA) for self-supervised learning of hierarchical abstractions; world models that simulate real-world dynamics for better prediction and action.
Progress till now (2026): LeCun founded AMI Labs to advance world models; JEPA prototypes show promise in video prediction. DeepMind and others are racing on similar tech, with early integrations into robotics.
Future trajectory: Hybrid systems combining LLMs with world models for AGI, scaling via more multimodal data.
Limitations: High compute needs, incomplete physical simulations, early-stage—far from human-level understanding.
Progress on JEPA/world models (2026): AMI Labs (LeCun's venture) raised €500M for hierarchical JEPA; prototypes excel in video prediction and action planning (e.g., arXiv 2601.00844). DeepMind's Genie 3 enables real-time 3D sims; integrations in robotics show promise.
Incomplete physical simulations: Models approximate Newtonian physics but falter on quantum effects, chaotic systems (e.g., turbulence), or multi-scale interactions due to compute limits and simplified abstractions—leading to errors in complex scenarios like fluid dynamics or material behaviors.
* According to Google DeepMind chief executive Demis Hassabis, AI could reach artificial general intelligence within five to eight years. At the India AI Impact Summit 2026 in New Delhi, he highlighted limitations of current AI systems like inconsistency, lack of creativity, and true general intelligence—echoing LeCun's critiques of LLMs.
* Google's AI Food tool, Food Mood is a playful fusion recipe generator that creates unique recipes inspired by multi country cuisine. Select the type of food (starter, soup, main course, or dessert), number of people to serve, and two countries for cuisine inspiration.
* ElevenLabs manages around 60,000 support calls for one client in English, Hindi and other Indian languages, and makes nearly 50,000 outbound calls a month for IDFC Bank.
* Sarvam Kaze is an AI-powered wearable spectacle can respond to inputs, interpret surroundings and record what users see. Custom experiences can be built for the device through the Sarvam platform.
* The AI-powered app, Shwaasa, uses an algorithm-based platform to screen patients for chronic obstructive pulmonary disease (COPD), one of India's major causes of illness. Last year, AllMS-Delhi conducted a validation study on 460 people at its Ballabhgarh unit. When compared with spirometry— the gold standard for lung function testing — the tool showed moderate overall correlation and strong agreement in severe cases.
Key hurdles in making spirometry widely available in India include high equipment and test costs, unaffordability for many patients, limited access in primary/rural care, shortage of trained technicians, time-consuming procedures, and challenges with patient cooperation and interpretation.
False positive rates in AI health apps vary widely (5-30%) based on the app, condition, and study. For Shwaasa, a Nature study shows 75% specificity, implying ~25% false positives—meaning 1 in 4 healthy users might get a false alert. Apps like this are for screening, not diagnosis; professionals should confirm results. Data collection often aids AI improvement but can benefit pharma companies—review privacy terms.
* A McKinsey & Company analysis estimates that by 2030, around 70 per cent of all new data centre capacity will cater to AI-heavy workloads.
Researchers at the University of California found that after generating a simple 100-word email, an AI chatbot’s cooling systems consume between half and three bottles of water (200–1,500 millilitres), depending on local conditions. Training these models requires exponentially more: Meta reportedly used 22 million litres of water to train its LLaMA-3 model.
As India races to power its AI revolution, massive data centres are being built in areas already struggling for water, raising fears that the country’s digital boom could deepen local water crises - Down To Earth
* "AI is a big growth booster for IT services, not a cost-cutter. Coforge now integrates AI in all new projects, shifting from trials to real-world delivery. Despite lower renewal rates (30-50% discounts), tech spending is rising, creating opportunities." - Sudhir Singh, Coforge
* Elon Musk - Flight code for the SpaceX rockets and Starlink satellites is written in C and C++.
Python is used where runtime performance is less important than rapid iteration and ease of use.
* A list of physical visualizations and related artifacts
* Hyderabad Urban Observatory from Hyderabad Urban Lab is envisaged as a set of projects around:
- data repositories, archives and libraries;
- research and advocacy;
- physical and social interventions around communities and places.

Comments
Post a Comment