🗞️ Why in News India is bidding to attract over $200 billion in AI infrastructure investment by 2028, positioning itself as a global AI hub. However, a Carnegie Endowment analysis identifies three critical deficits — talent, data, and R&D — that could undermine these ambitions.

The Editorial Argument

The Indian Express editorial argues that India’s AI mission focuses too narrowly on compute infrastructure (GPU clusters, data centres) while neglecting the three foundations that actually determine AI capability: a deep talent pipeline, high-quality training data, and indigenous R&D capacity. Without addressing these deficits, India risks becoming a low-cost data centre host rather than an AI innovator.

India’s AI Ambitions

Initiative Detail
IndiaAI Mission Rs 10,372 crore ($1.25 billion); approved March 2024
Compute target 10,000 GPUs in government-funded AI compute facility
AI investment target $200 billion by 2028 (per government estimates)
MeitY AI centres 7 Centres of Excellence in AI across IITs and IISc
AIRAWAT AI Research, Analytics and Knowledge Assimilation Technology platform

Deficit 1: Talent

India produces ~1.5 million engineering graduates annually but faces a severe shortage of AI-specific talent:

Metric Data
Engineering graduates/year ~1.5 million
AI/ML specialists (estimated) ~50,000-80,000
Global AI talent concentration US (40%), China (11%), UK (7%), India (~5%)
AI PhDs (India, annual) ~500-700
AI PhDs (US, annual) ~3,000-4,000
Brain drain factor ~40% of IIT AI graduates move abroad for higher studies/jobs

The deficit is not in basic coding skills but in deep AI research capability — the ability to build new architectures, develop novel training methods, and push the frontier of AI science.

Deficit 2: Data

AI models are only as good as their training data. India’s data ecosystem has critical gaps:

  • Language data: India has 22 scheduled languages and 780+ spoken languages. High-quality datasets exist for Hindi and English but are sparse for most Indian languages
  • Healthcare data: India has no national health data infrastructure comparable to the UK’s NHS Digital or the US’s NIH databases. ABDM (Ayushman Bharat Digital Mission) is building this but coverage remains low
  • Agricultural data: Despite 140 million farming households, digitised crop data, soil health records, and yield predictions remain fragmented
  • Data quality: Much available data is noisy, biased, or poorly labelled — a problem that compute power alone cannot solve

The Bhashini platform (National Language Translation Mission) is a positive step for multilingual AI, but it covers only translation — not the broader training data needed for domain-specific AI applications.

Deficit 3: R&D

India’s AI spending is heavily skewed toward deployment (using existing models) rather than development (creating new ones):

Metric India US China
AI R&D spend (% of GDP) ~0.03% ~0.3% ~0.2%
Foundation model labs 0 major OpenAI, Google, Anthropic, Meta Baidu, Alibaba, ByteDance
AI patents (2023) ~5,000 ~45,000 ~65,000
AI research papers (2023) ~15,000 ~30,000 ~50,000

India uses AI models built elsewhere. It does not yet have the R&D infrastructure to build frontier models — the large language models and multimodal systems that define the current AI revolution.

Policy Recommendations

The editorial proposes:

  1. AI Faculty Mission: Recruit 500 AI researchers globally for Indian universities — offer competitive salaries, research grants, and dual-appointment flexibility
  2. National AI Data Commons: Create curated, labelled datasets for Indian languages, healthcare, agriculture, and governance — publicly available for research
  3. AI R&D Fund: Dedicate Rs 5,000 crore ($600 million) exclusively for frontier AI research — not deployment, not compute, but fundamental science
  4. Retain talent: Fast-track AI PhD programmes at IITs with industry-matched stipends (Rs 1 lakh+/month) to compete with foreign offers
  5. Open-source focus: Support Indian contributions to open-source AI frameworks rather than licensing proprietary models

SAHI Framework Connection

The SAHI (Strategy for AI in Healthcare for India) framework launched by MoHFW represents the deployment model — using AI for cancer screening, maternal health, and disease surveillance. But deployment without indigenous R&D means permanent dependence on foreign AI labs for model updates, bias corrections, and capability extensions.

UPSC Relevance

Prelims: IndiaAI Mission budget, AIRAWAT, Bhashini platform, MeitY AI centres, AI patent numbers

Mains GS-3: AI as technology — indigenisation, R&D investment, intellectual property; India’s digital economy; science and technology policy

📌 Facts Corner — Knowledgepedia

IndiaAI Mission:

  • Budget: Rs 10,372 crore (~$1.25 billion); approved March 2024
  • Compute: 10,000 GPUs target for government AI facility
  • MeitY: 7 Centres of Excellence in AI (IITs, IISc)
  • AIRAWAT: AI Research, Analytics and Knowledge Assimilation Technology platform

India’s AI Talent:

  • Engineering graduates: ~1.5 million/year
  • AI/ML specialists: ~50,000-80,000
  • AI PhDs (annual): ~500-700 (vs US: ~3,000-4,000)
  • Brain drain: ~40% of IIT AI graduates move abroad

India’s AI Data Initiatives:

  • Bhashini: National Language Translation Mission (multilingual AI)
  • ABDM: Ayushman Bharat Digital Mission (health data infrastructure)
  • India Data Management Office (IDMO): Under MeitY
  • Digital India Act (proposed): Data governance framework

Other Relevant Facts:

  • Global AI market (2025): ~$200 billion; projected $1 trillion by 2030
  • India’s AI R&D spend: ~0.03% of GDP (vs US: ~0.3%, China: ~0.2%)
  • Carnegie Endowment: “Missing Pieces in India’s AI Puzzle” (2025 report)
  • NASSCOM estimate: India’s AI market to reach $17 billion by 2027
  • Top AI companies in India: TCS, Infosys, Wipro (services); Krutrim, Sarvam AI (startups)

Sources: Indian Express, Carnegie Endowment, MeitY