AI & LLM Engineering Internship 2026 at Micro Infotech | Work From Home | ₹5,000 Monthly Stipend
Are you passionate about Artificial Intelligence, Large Language Models (LLMs), and Generative AI? Micro Infotech has announced an exciting AI & LLM Engineering Internship for students and freshers across India. This virtual internship offers hands-on experience with cutting-edge AI technologies, including Groq, Cerebras, OpenRouter, Ollama, and vLLM.
Internship Overview
- Organization: Micro Infotech
- Position: AI & LLM Engineering Intern
- Internship Type: Virtual Internship (Work From Home)
- Location: Pan India
- Start Date: Immediately
- Duration: 3 Months
- Stipend: ₹5,000 per month
- Credits: 12
- Apply By: 25 June 2026
- Number of Openings: 10
About the Internship
This internship is designed for candidates interested in AI infrastructure, LLM optimization, and inference engineering. Interns will work on improving model performance, reducing latency, optimizing costs, and building scalable AI systems.
The selected candidates will be responsible for ensuring that every model call within the platform remains fast, reliable, and cost-efficient.
Key Responsibilities
Selected interns will:
- Benchmark AI inference providers such as Groq, Cerebras, Together AI, Fireworks, and OpenRouter.
- Compare latency, throughput, and accuracy across different AI models.
- Configure Ollama 0.3.x with speculative decoding for local fallback models.
- Build and maintain the ModelRouter system with automatic provider rotation.
- Monitor memory usage and token consumption for worker agents.
- Reduce average AI inference costs through optimization techniques.
- Create technical documentation related to inference optimization.
Required Skills
Applicants should have:
- Intermediate-level Python programming skills.
- Understanding of asynchronous programming concepts.
- Familiarity with REST APIs and JSON.
- Basic knowledge of Large Language Models (LLMs).
- Understanding of tokens, temperature settings, and context windows.
Preferred Skills
Candidates with experience in the following tools will have an advantage:
- Ollama
- LM Studio
- Local AI model runners
- Generative AI tools
Technologies You Will Learn
During the internship, candidates will gain exposure to:
- vLLM
- Cerebras WSE-3 API
- OpenRouter Free Routing
- Speculative Decoding
- Provider Failover Architecture
- AI Inference Optimization
- LLM Deployment Techniques
Internship Perks
Selected candidates will receive:
- Certificate of Completion
- Letter of Recommendation
- Learning Allowance
- Flexible Working Hours
- Hybrid Working Opportunities
- Pre-Placement Offer (PPO)
- Potential Job Offer
Who Can Apply?
Candidates who:
- Are from any educational background.
- Have relevant skills and interests in AI and Machine Learning.
- Are available for the complete 3-month internship duration.
- Can work full-time remotely.
Terms of Engagement
- Internship Type: Work From Home
- Timing: Full Time
- Mode: Virtual Internship
Why Apply?
This internship provides a valuable opportunity to work with modern AI technologies and gain practical experience in Large Language Models, AI infrastructure, and model optimization. It is especially beneficial for students and freshers looking to build a career in Artificial Intelligence, Machine Learning, NLP, and Generative AI.
Apply Now:- Click Here

I am Sonu Singh, a professional Blogger with 7 years of experience. I work across multiple websites, creating high-quality content and helping brands grow through SEO-driven blogging.