We are looking for a highly skilled and motivated AI/ML Engineer with 2–3 years of experience to join our growing team. In this role, you will work on the development, optimization, and deployment of intelligent AI/ML models with a strong focus on NLP and LLM-based systems. You’ll collaborate across engineering, product, and data teams to build AI-powered features into scalable web and enterprise platforms.
Key Responsibilities
• Design, develop, and deploy machine learning and NLP models for use cases like classification, summarization, entity extraction, recommendation, and document intelligence
• Build and maintain Multi-Component Pipelines (MCPs) involving LLMs, embedding generation, and vector search
• Work with Large Language Models (LLMs) and integrate with LangChain and Agentic architectures to build context-aware, task-specific agents
• Implement Retrieval-Augmented Generation (RAG) pipelines using vector databases including Pinecone and ChromaDB
• Integrate AI models into scalable backend systems using Django, FastAPI, or Flask, and deploy APIs for intelligent feature consumption
• Use tools like LangChain, Hugging Face Transformers, and LlamaIndex to support modular and reusable NLP workflows
• Collaborate with cross-functional engineering teams to embed intelligent agents and LLMs into web platforms and applications
• Conduct model evaluation, A/B testing, optimization, and hyperparameter tuning to ensure model performance and reliability in production
• Manage deployments on cloud platforms such as AWS, GCP, or Vertex AI with Docker, Git, and CI/CD practices
• Keep up with research and trends in generative AI, vector search, multi-agent coordination, and privacy-aware data handling
• Work with relational and non-relational databases (e.g., PostgreSQL, MySQL, MongoDB) to support data-driven model training, storage, and real-time inference.