W

Technical Program Manager

While technology
Work From Home
United States
Manager

While technology is the heart of our business, a global and diverse culture is the heart of our success. We love our people and we take pride in catering them to a culture built on transparency, diversity, integrity, learning and growth.


If working in an environment that encourages you to innovate and excel, not just in professional but personal life, interests you- you would enjoy your career with Quantiphi!

About Quantiphi:

Quantiphi is an award-winning Applied AI and Big Data software and services company, driven by a deep desire to solve transformational problems at the heart of businesses. Our signature approach combines groundbreaking machine learning research with disciplined cloud and data-engineering practices to create breakthrough impact at unprecedented speed.

Company Highlights:

Quantiphi has seen 2.5x growth YoY since its inception in 2013, we don’t just innovate - we lead. Headquartered in Boston, with 4,000+ Quantiphi professionals across the globe. As an Elite/Premier Partner for Google Cloud, AWS, NVIDIA, Snowflake, and others, we’ve been recognized with:

  • 17x Google Cloud Partner of the Year awards in the last 8 years.
  • 3x AWS AI/ML award wins.
  • 3x NVIDIA Partner of the Year titles.
  • 2x Snowflake Partner of the Year awards.
  • We have also garnered top analyst recognitions from Gartner, ISG, and Everest Group.
  • We offer first-in-class industry solutions across Healthcare, Financial Services, Consumer Goods, Manufacturing, and more, powered by cutting-edge Generative AI and Agentic AI accelerators.
  • We have been certified as a Great Place to Work for the third year in a row- 2021, 2022, 2023.

Be part of a trailblazing team that’s shaping the future of AI, ML, and cloud innovation. Your next big opportunity starts here!

Work Location: Bedminster, NJ or Dallas, TX

Responsibilities:

  • Lead AI/ML program execution, ensuring timely delivery of scalable, production-grade RAG/LLM/Agentic solutions.
  • Define program roadmaps through PI planning sessions, milestones, and deliverables for AI-driven initiatives across multiple teams.
  • Manage LLM infrastructure, GPU optimization, AI inferencing pipelines, and large-scale model deployment strategies.
  • Oversee the implementation of RAG, Agentic Workflows, multi-agent LLM systems, and Retrieval-augmented QA pipelines.
  • Managing client engagement and delivery per terms of the contract expectations.
  • Manage project delivery, team and ensure positive customer relations.
  • Drive project margins optimization using Gen AI based tools, accelerators.
  • Collaborate with our diverse and global teams to deliver committed results to our clients.
  • Lead AI-driven engagements, ensuring alignment with business goals, technical feasibility, and governance frameworks.
  • Develop and execute strategic roadmaps for LLM-based solutions, including RAG (Retrieval-Augmented Generation), Agentic RAG, and Agent-driven workflows.
  • Manage cross-functional teams, including ML engineers, data scientists, software developers, and consultants to deliver AI solutions.
  • Collaborate with stakeholders to define technical architecture, infrastructure requirements, and optimization techniques.
  • Implement scalable AI agent architectures, ensuring integration with LangChain, NVIDIA NeMo, and Triton Inference Server.
  • Track project performance, set KPIs, and provide executive-level reporting on outcomes and ROI.
  • Guide AI model evaluation, MLOps pipeline integration, and fine-tuning strategies for scalable AI solutions.
  • Support AI compliance strategies, ensuring alignment with data privacy, security, and responsible AI practices.

Skill Set Required:

  • More than 8 years of program management experience.
  • Strong leadership and multi-stakeholder management skills.
  • Multi-Workstream Project Management ensuring customer success & account growth.
  • Maintaining positive work environment & ensure career growth of the team members.
  • Tight Delivery execution and reporting to senior management at client organization and at Quantiphi.
  • Mentoring team members for career progression & upskilling to drive better solution outcomes.
  • Team leading experience and ability & experience to work as project lead.
  • Excellent Communication, presentation & storytelling skills.
  • Must have experience with Cloud GCP or AWS or Azure (LLM hosting, GPU-based inference, cost optimization).
  • Experience managing large-scale AI projects leveraging LLMs (e.g., Llama, GPT, Claude, Mistral).
  • Strong expertise in RAG, Agentic RAG, AI Agents, Vector DBs (e.g., FAISS, Pinecone, Weaviate, ChromaDB).
  • Knowledge of LLM-based fine-tuning techniques, Low-Rank Adaptation (LoRA), Quantization (AWQ, GPTQ, FP8, INT4).
  • Familiarity with Multi-GPU parallelization, model pruning, and knowledge distillation.
  • Understanding of Governance frameworks (e.g., AI Ethics, Explainability, Risk Mitigation).
  • Proficiency in NVIDIA NeMo, Triton Inference Server, and LangChain for agentic workflows.

What is in it for you:

  • Be part of a team and company that has won NVIDIA's AI Services Partner of the Year three times in a row with an unparalleled track record of building production AI applications on DGX and Cloud GPUs.
  • Strong peer learning which will accelerate your learning curve across Applied AI, GPU Computing and other softer aspects such as technical communication.
  • Exposure to working with highly experienced AI leaders at Fortune 500 companies and innovative market disruptors looking to transform their business with Generative AI.
  • Access to state-of-the-art GPU infrastructure on the cloud and on-premise.
  • Be part of the fastest-growing AI-first digital transformation and engineering company in the world

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Apply Now