AI4Bhārat

AI4Bhārat · 2025-08-20T13:45:22.566Z

Meet AI4Bhārat @ Interspeech 2025 in Rotterdam, Netherlands! 🤝 We are excited to host an informal meetup on Thursday, 21 Aug @ 3 PM at the Top Floor, Food Court Foyer, Rotterdam Ahoy, Netherlands. ✨ Why join? Students & postdocs → explore research opportunities at AI4Bharat Researchers & postdocs → learn about faculty roles at WSAI (IIT Madras) Everyone → connect with us on our open source models, datasets, tools and more. Come grab a coffee and chat with us. Let’s join forces to build the future of Indian language AI together! Interested? Please apply here: https://lnkd.in/gwHEqazK #interspeech #interspeech2025 Mitesh Khapra | AI4Bhārat | Wadhwani School of Data Science and AI, IIT Madras | Indian Institute of Technology, Madras

Research Services

Chennai, Tamil Nadu 20,018 followers

Research Lab @IIT Madras working on developing open-source datasets, tools, models and applications for Indian languages

View all 180 employees

About us

Artificial-Intelligence-For-Bhārat : The focus of AI4Bharat, an initiative of IIT Madras, is on building open-source language AI for Indian languages, including datasets, models, and applications.

Website: https://ai4bharat.iitm.ac.in/
External link for AI4Bhārat
Industry: Research Services
Company size: 51-200 employees
Headquarters: Chennai, Tamil Nadu
Type: Government Agency
Specialties: Artificial Intelligence, NLP, Deep Learning, Opensource, and AI

Locations

Primary

IIT Madras

Guindy

Chennai, Tamil Nadu 600036, IN

Get directions

Employees at AI4Bhārat

See all employees

Updates

AI4Bhārat reposted this
EkStep Foundation

12,349 followers
2w Edited
Report this post
Catch Veena Venugopal’s interview with our Chairperson and Co-Founder, Nandan Nilekani on the Financial Times’ India Business Briefing, where they discuss the setting up of India’s digital public rails and how AI will transform agriculture (OpenAgriNet, vistaar.maharashtra.gov.in) and education (all.ekstep.org) across India in the next 5 years. For more: Article on Financial Times - https://on.ft.com/483e4CE . Shankar Maruwada Jagadish Babu OpenAgriNet Gates Foundation AI4Bhārat Pritha Choudhuri Prasad Paramashivappa Sumangala GM Varun Garg Venkataramanan Sriraman Sureshkumar Krishnamurthy Abhineet Singh Ananthi Balakrishnan | Kirti Pandey Mohit Garg Thejaswini Anand Bharath Shankar Ganapathy Santosh Kevlani Ramesh MC Pramod Varma Madhuchandra R Srikanth Gopalakrishnan Deepika Mogilishetty sujith nair

Like Comment Share
AI4Bhārat reposted this
AI Kyro

329 followers
2w
Report this post
Defining the Future of Indian AI: Dr. Mitesh Khapra A massive takeaway from the AI Kyro AI Summit: The need for Sovereign AI. Dr. Mitesh Khapra's keynote was an inspiring call to action, showcasing the critical work of AI4Bhārat in building foundational AI models for Indian languages. His vision goes beyond innovation; it's about empowering the "next billion users" by making AI accessible and relevant in the native tongues. AI is the technology of the future, we need to build this future in India, for Indians, by Indians. This is the essence of building a truly inclusive tech ecosystem. Thank you, Dr. Mitesh Khapra, for an unforgettable address! Anoop Ambika Balamurali A R, PhD Kiran Ajayakumar Sony A Naveen Nair Vijay Krishna Menon, Ph.D. Ashwin Uthappa Viju Chacko Deepak Achuthan Vineet C. Nambiar Soloman P A Mohammed Alsolami Padmakumar Sonnet Thrissur Management Association AI Kyro #AISummit #IndicLLM #Sovereignty #AI4Bharat #AIKyro #IndicAI
6 Comments

Like Comment Share
AI4Bhārat

20,018 followers
2w Edited
Report this post
AI4Bharat began with a handful of students at IIT Madras exploring deep learning and building small models, testing computer vision, and translating text. What started as experiments quickly became a focused mission: making AI work for India’s many languages. Today, AI4Bharat’s open models and datasets are helping communities across the country access digital services in their own languages. In this video, Pratyush Kumar, Co-founder of Sarvam AI, shares how small beginnings grew into a national AI movement: https://lnkd.in/gUymyS-N Mitesh Khapra , Pratyush Kumar , Anoop Kunchukuttan , Vivek Raghavan , People+ai , EkStep Foundation , Sarvam

2 Comments

Like Comment Share
AI4Bhārat reposted this
People+ai

11,265 followers
1mo
Report this post
Ever asked ChatGPT detailed questions about Indian culture, heritage, or history? The answers are often inaccurate because these models aren't trained on datasets that truly represent India's diversity. We're partnering with AI4Bhārat to solve this by collecting authentic data across Indian districts. Our goal is to train models that accurately capture our culture, heritage, history, and traditions. We need your expertise. We’re looking for historians, cultural experts, linguists, and social scientists to help us define what culture truly looks like and guide our data collection. Interested in building LLMs that actually understand India? Let's talk. David Joseph Menezes Santosh Kevlani Pranava C Hiremath

Like Comment Share
AI4Bhārat

20,018 followers
1mo
Report this post
The scale of IndicVoices is powered by people. Nearly 1,900 coordinators, mobilizers, and experts worked alongside 16,000+ participants to capture speech in 22 Indian languages. Their role wasn’t limited to logistics. They built trust, helped participants navigate apps, and encouraged natural conversation. AI4Bharat’s mission depends on this collaboration across the country. 🔗 Read the blog: https://lnkd.in/gxVn9525 🔗 Research paper: https://lnkd.in/dUwzxU_x 🔗 Read the Report here: https://lnkd.in/gsWSqSnj Mitesh Khapra Pratyush Kumar Anoop Kunchukuttan Vivek Raghavan People+ai EkStep Foundation

7 Comments

Like Comment Share
AI4Bhārat reposted this
Wadhwani School of Data Science and AI, IIT Madras

8,660 followers
1mo
Report this post
A moment of global pride for India in the field of AI. Prof. Mitesh M. Khapra, Co-founder at the Nilekani Centre at AI4Bharat, WSAI, IIT Madras, has been featured among '2025 TIME100 AI List of the World’s Most Influential People in Artificial Intelligence' AI4Bharat is one-of-a-kind project which collected thousands of hours of voice data across 400 districts. His pioneering work is bridging the AI gap for Indian languages. 🔗: https://lnkd.in/g8TS4wBP Mitesh Khapra | TIME | Balaraman Ravindran | Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI) | IBSE: Centre for Integrative Biology and Systems medicinE | AI4Bhārat | EkStep Foundation | People+ai | Centre For Responsible AI (CeRAI) | Walmart Center for Tech Excellence | Indian Institute of Technology, Madras | IndiaAI | Ministry of Education, Government of India, New Delhi #TIME100AI #ArtificialIntelligence #TIME100AI2025 #AI4Bharat #FutureOfAI #IndiansInSTEM #DigitalIndia #SovereignAI #IITMadras #MiteshKhapra
46 Comments

Like Comment Share
AI4Bhārat

20,018 followers
1mo Edited
Report this post
🚀 Hiring: Data Engineer – Multimodal AI AI4Bhārat is building the next generation of large-scale multimodal AI systems across Speech, NLP, and Vision. We are looking for exceptional Data Engineers (full-time + interns) to join our team in Chennai and help us push the frontier of multimodal AI. The Role: As a Data Engineer (Multimodal), you will be at the heart of our mission: Building the data backbone for Indian Language AI - the foundation that powers frontier multimodal models. This is not a standard ETL role. You’ll design massive-scale data pipelines, integrate state-of-the-art multimodal AI models spanning speech, vision, and language directly into workflows, and optimize across 100s of GPUs. Your work will directly enable training of models that set new benchmarks for Indian and global AI. What you’ll do: - Architect and scale data pipelines for multimodal corpora (speech, vision, text) at petabyte scale. - Integrate AI models in-the-loop for data cleaning, filtering, and enrichment. - Build and optimize training and benchmarking pipelines for cutting-edge AI models. - Work with cutting-edge dataset formats (WebDatasets, Arrow/Parquet, HF Datasets) optimized for training throughput. - Deep understanding of data formats and sharding strategies for efficient training. - Evaluate and stress-test AI systems across languages, modalities, and domains. - Work on real-world, high-impact AI challenges for Indian languages. What we’re looking for: - Proficiency in Python & PyTorch, with strong fundamentals in deep learning, data engineering, and distributed systems. - Hands-on with Linux, Bash, multiprocessing/SLURM, and modern data tooling (HF Datasets, PyArrow, WebDatasets, Polars). - Good fundamentals in algorithms, systems, and databases, plus hands-on with Git, Docker, and cloud/HPC environments. - Freshers and engineers with 1–3 years of experience welcome. - Degree in CS, DS, or related fields (B.Tech / M.Tech / Masters) Bonus points if you have: - Experience experimenting with LLMs in production workflows (e.g., integrating NeMo, HuggingFace, or custom LLM APIs into pipelines) - Hands-on exposure to large-scale, high-throughput data pipelines or multilingual AI challenges (speech, text, and vision) - Understanding of distributed training frameworks (like DeepSpeed, FSDP) and GPU cluster orchestration. - A strong track record in open-source contributions, or building tools and libraries that others use at scale Why join us: - Work on impact-driven AI for Indian languages. - Get access to large-scale GPU clusters to run bold experiments. - Join a friendly, high-energy team that thrives on solving hard problems 📍 Location: Chennai (on-site preferred, hybrid possible) 📅 Deadline: Apply before 5th Sept - applications will be reviewed on a rolling basis. 🚀 Start Date: Immediate (flexible for the right candidate) 📝 Apply here: https://lnkd.in/gJVafUiM Mitesh Khapra Anoop Kunchukuttan S V Praveen Kaushal Bhogale

17 Comments

Like Comment Share
AI4Bhārat

20,018 followers
1mo Edited
Report this post
Meet AI4Bhārat @ Interspeech 2025 in Rotterdam, Netherlands! 🤝 We are excited to host an informal meetup on Thursday, 21 Aug @ 3 PM at the Top Floor, Food Court Foyer, Rotterdam Ahoy, Netherlands. ✨ Why join? Students & postdocs → explore research opportunities at AI4Bharat Researchers & postdocs → learn about faculty roles at WSAI (IIT Madras) Everyone → connect with us on our open source models, datasets, tools and more. Come grab a coffee and chat with us. Let’s join forces to build the future of Indian language AI together! Interested? Please apply here: https://lnkd.in/gwHEqazK #interspeech #interspeech2025 Mitesh Khapra | AI4Bhārat | Wadhwani School of Data Science and AI, IIT Madras | Indian Institute of Technology, Madras
5 Comments

Like Comment Share
AI4Bhārat reposted this
Indian Institute of Technology, Madras

607,801 followers
2mo Edited
Report this post
At the heart of India’s AI renaissance lies Indian Institute of Technology, Madras, with several visionary leaders featured in India’s 100 Most Influential People in AI by Analytics India Magazine (AIM). IIT Madras Director Prof. Kamakoti Veezhinathan, widely known as the “SHAKTI-man of India,” led the creation of SHAKTI, the country’s first RISC-V microprocessor, and has driven the “Startup Shatam” mission, incubating over 100 deep-tech startups in FY25. Prof. Balaraman Ravindran, head of the Wadhwani School of Data Science and AI, IIT Madras, recently elected as an AAAI Fellow, is a leading researcher advancing fairness and ethics in AI, with studies showcased at top global conferences. Dr. Mitesh Khapra, Professor at the Wadhwani School of Data Science and AI, IIT Madras, head of the AI4Bhārat lab, is pioneering Indian-language AI, powering models like IndicTrans2 and contributing around 80% of the Bhashini mission’s data. Dr. Pratyush Kumar, Adjunct Professor at the Department of Computer Science and Engineering, and Co-Founder, Sarvam AI, is making waves with cutting-edge research on multilingual datasets, speech translation, and trustworthy AI systems. Dr. Tamaswati Ghosh, CEO of IITM Incubation Cell, driving innovation and entrepreneurship at IIT Madras, has been instrumental in fostering impactful AI-led startups. Together, they embody IIT Madras’s commitment to homegrown, inclusive, and ethical AI—pushing boundaries in research, technology, and societal impact. Read the full article here: https://lnkd.in/gTN43iSn #IITMadras #AI4India #SHAKTI #IndicAI #DeepTech #AIResearch
22 Comments

Like Comment Share
AI4Bhārat reposted this
Wadhwani School of Data Science and AI, IIT Madras

8,660 followers
2mo
Report this post
The Wadhwani School of Data Science and AI at Indian Institute of Technology, Madras extends its congratulations to Prof. Balaraman Ravindran and Prof. Mitesh M. Khapra for being featured among ‘India’s 100 Most Influential People in AI’ by Analytics India Magazine AIM Their innovative contributions remain a pivotal voice in advancing both the science and ethics of AI. Prof. Balaraman Ravindran is one of India’s most influential AI researchers, with over 30 years of experience in machine learning, particularly in the field of reinforcement learning. His contributions extend far beyond campus. Ravindran has advised the Indian government, the RBI, the World Economic Forum, and the REAIM council on the ethical, responsible deployment of AI, particularly in sensitive domains like defence and public policy. Prof Mitesh Khapra, Associate Professor at IIT Madras and the founding head of the AI4Bharat Research Lab, is quietly driving India towards AI that truly speaks Indian languages. His goal since 2020 has been to bring Indic AI up to the same standard as English. What began with painstaking data collection and model-building across a few languages has blossomed into one of the world’s largest open-source datasets, supporting IndicTrans2, a translation model that covers all 22 scheduled Indian languages—and it’s only getting stronger. Read the full article here: https://lnkd.in/gTN43iSn #India100MostInfluentialPeopleinAI #AnalyticsIndiaMag #AIResearch #DataScienceFellowship #IITMadras #ArtificialIntelligence #MachineLearningIndia #TechResearch #FutureOfAI #DeepLearning #IndiansInSTEMTags #WSAI #WadhwaniSchool #AIResearch #DataScience #AIrevolution AIM | Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI) | IBSE: Centre for Integrative Biology and Systems medicinE | AI4Bhārat | Centre For Responsible AI (CeRAI) | Walmart Center for Tech Excellence | Indian Institute of Technology, Madras | IndiaAI
58 Comments

Like Comment Share

Browse jobs

Funding

AI4Bhārat 1 total round

Last Round

Seed Aug 7, 2023

US$ 12.0M

Investors

Peak XV Partners Lightspeed Venture Partners

See more info on crunchbase

AI4Bhārat

Research Services

Chennai, Tamil Nadu 20,018 followers

Research Lab @IIT Madras working on developing open-source datasets, tools, models and applications for Indian languages

About us

Locations

Employees at AI4Bhārat

Anoop Kunchukuttan

Researcher-Microsoft | Co-founder and Co-lead-AI4Bharat | Machine Translation, Multilingual Learning and Indian Language NLP

Dhwani Dalal

Writer | Brand Strategist | Interpreter | Translator | Gujarati<Hindi<English

Syed Abid Gowhar

#Freelancer #translator #writer #interpreter #journalist#voice_over#broadcaster#voice_dubbing #urdu#kashmiri #AI4bharat #IITM #voice_consultant…

Arun Prakash

ML Researcher and Educator @ AI4Bharat

Updates

Join now to see what you are missing

Similar pages

Sarvam

BHASHINI - (Digital India BHASHINI Division)

Krutrim

AI4Bharat

IndiaAI

People+ai

Karya

Wadhwani School of Data Science and AI, IIT Madras

Centre For Responsible AI (CeRAI)

Robert Bosch Centre for Data Science and Artificial Intelligence (RBCDSAI)

Browse jobs

Developer jobs

Scientist jobs

Engineer jobs

Linux Administrator jobs

Associate Developer jobs

Human Resources Trainee jobs

Analyst jobs

Human Resources Intern jobs

Social Media Manager jobs

System Analyst jobs

Machine Learning Engineer jobs

Program Manager jobs

Researcher jobs

Writer jobs

Marketing Specialist jobs

Marketing Manager jobs

Project Manager jobs

Content Editor jobs

Proofreader jobs

Journalist jobs

Funding