Job Title: AI / Machine Learning Engineer
Location: Remote (India)
Employment Type: Full-Time
Reporting To: Head of Tech & R&D
The Opportunity
AssistifAI Inc. is a Canadian-registered technology company building the next generation of unified AI workforces. We are replacing legacy, fragmented customer service infrastructure with native Voice Assistants, automated Telecalling, intelligent Scheduling, and our proprietary 'Keeper' cross-channel conversation monitoring layer.
We are looking for a rigorous, mathematically driven AI/ML Engineer to join our core technical team in India. Working directly alongside our Head of Tech & R&D, you will not just be making API calls; you will be architecting, fine-tuning, and deploying the low-latency machine learning models that power our real-time media pipelines.
Key Responsibilities
- Model Deployment & Optimization: Design, train, and deploy generative AI models to handle real-time conversational tasks, ensuring ultra-low latency for native voice and telecalling applications.
- Pipeline Engineering: Build and maintain scalable ML infrastructure that integrates seamlessly with our 'Keeper' observability layer to monitor conversation quality and hallucination rates in real-time.
- LLM Orchestration: Implement advanced prompting strategies, RAG (Retrieval-Augmented Generation) architectures, and fine-tuning of open-source models (Llama, Mistral, etc.) to optimize for cost and accuracy.
- Audio & NLP Processing: Work on the intersection of Speech-to-Text (STT), Text-to-Speech (TTS), and Natural Language Understanding to make our voice assistants indistinguishable from human operators.
- Continuous R&D: Stay at the bleeding edge of the AI landscape, rapidly prototyping new research papers and evaluating emerging models for production viability.
Who You Are
- The Architect: You have 3+ years of rigorous software engineering and machine learning experience, with a deep understanding of Python, PyTorch, or TensorFlow.
- The LLM Native: You have hands-on experience moving LLMs from experimental notebooks into high-traffic, production environments.
- The Optimizer: You obsess over milliseconds. You understand how to optimize inference speeds, manage token streaming, and reduce computational overhead.
- The Builder: You thrive in an agile, zero-red-tape startup environment. You take extreme ownership of your code from local testing to global deployment.
Technical Requirements
| Category |
Skills & Stack |
| Core Languages |
Python (Expert), C++ (Bonus for performance optimization), SQL. |
| AI / ML Frameworks |
PyTorch, TensorFlow, LangChain, LlamaIndex, Hugging Face. |
| Cloud & MLOps |
AWS / GCP / Azure, Docker, Kubernetes, CI/CD pipelines. |
| Domain Expertise |
NLP, Conversational AI, RAG architectures, Audio/Speech processing (WebRTC, STT/TTS). |
β
Why Join Us?
- Global Impact: Build systems that are aggressively disrupting the North American and European enterprise markets.
- Technical Mentorship: Work directly under high-level R&D leadership with a clear mandate to innovate.
- Culture of Autonomy: Remote-first culture that values deep, uninterrupted technical work over meetings.
- Competitive Package: Attractive base salary aligned with top Indian tech standards, plus potential for early-stage equity/stock options.