Gen AI Engineer
Actively Reviewing the ApplicationsViraaj HR Solutions Private Limited
India, Tamil Nadu, Chennai
Full-Time
On-site
INR 10–30 LPA
Posted 3 weeks ago
•
Apply by June 11, 2026
Job Description
A fast-scaling company in the Enterprise AI and Intelligent Automation sector building production-grade generative AI solutions for enterprise search, virtual assistants, and decision automation. We are hiring for the primary role: Generative AI Engineer. Location: India — On-site.
Role & Responsibilities
Must-Have
Skills: kubernetes,docker,python,pytorch
Role & Responsibilities
- Design and implement end-to-end generative AI solutions: data ingestion, model fine-tuning, RAG pipelines, and production inference.
- Fine-tune, evaluate, and benchmark LLMs using SFT/RLHF techniques; maintain reproducible experiment tracking and model versioning.
- Build retrieval-augmented generation architectures integrating vector stores and semantic search to improve accuracy and context grounding.
- Develop scalable inference services with optimization techniques (quantization, batching, model sharding) to meet latency and cost SLAs.
- Collaborate with MLOps and backend teams to create CI/CD, monitoring, alerting, and automated retraining pipelines for model governance.
- Drive technical best practices: code reviews, testable deployments, documentation, and mentor junior engineers on ML engineering standards.
Must-Have
- Python
- PyTorch
- Hugging Face Transformers
- LangChain
- Docker
- Kubernetes
- FAISS
- ONNX
- Triton Inference Server
- Proven experience deploying LLM-based features in production environments and ownership of lifecycle from training to monitoring.
- Familiarity with cloud platforms (AWS/GCP/Azure), GPU inference optimizations, and data privacy/compliance considerations for model deployment.
- Hands-on ownership of core AI products and opportunity to influence technical roadmap across product and infra domains.
- Collaborative, engineering-first culture with emphasis on learning, experimentation, and applied research to production ship.
- Competitive compensation, on-site collaboration, and career growth through mentoring and cross-functional exposure.
Skills: kubernetes,docker,python,pytorch
Required Skills
Engineering
Documentation
Automation
Compliance
Monitoring
Python
Cloud Platforms
Training
AWS
Research
Docker
Kubernetes
PyTorch
MLOps
Azure
Hugging Face Transformers
LangChain
CI/CD
Mentoring
Grounding
RAG
Intelligent automation
Governance
Hiring
Server
Data Privacy
Decision Automation
Data ingestion
Quantization
Vector
Generative
Semantic
Ingestion
Transformers
Optimization techniques
GPU
Semantic search
Fine-tuning
Privacy
Inference Server
Experimentation
Experiment
Retrieval-Augmented Generation
Inference
Model fine-tuning
Optimizations
Batching
Retrieval
LLMs
Model Deployment
Triton
Virtual
Generative AI
LLM
FAISS
RLHF
Triton Inference Server
Augmented Generation
Retrieval-augmented
Quick Tip
Customize your resume and cover letter to highlight relevant skills for this position to increase your chances of getting hired.
Related Similar Jobs
View All
Safety Executive - HSE
A.P. Moller - Maersk
India
Full-Time
₹3–4 LPA
Logistics
Training
Hiring
+16
Manager – Digital Engineering Product Lifecycle Management
PwC Acceleration Center India
India
Full-Time
Engineering
Data Analytics
Engineer - Transport Planning
WSP in India
India
Full-Time
Machine Learning
Engineering
Python
+1
Jr. Python Developer
PieFlowTech Solutions Private Limited
India
Full-Time
Git
MySQL
PostgreSQL
+4
Government Relations Coordinator
DB Engineering & Consulting
Communication
Engineering
Documentation
+20
Share
Quick Apply
Upload your resume to apply for this position