Company Overview:
Datakrew is revolutionizing EV fleet intelligence with cutting-edge IoT/AI solutions. Our flagship solution, OXRED Platform Suite, provides deep insights into vehicle fleet performance and diagnostics. Datakrew is backed by leading global investors like Greenwillow Capital, BEENEXT, 500 Global, AngelList, SEEDS (SG Growth Capital), XA Network, Cloud Capital, Lighthouse Canton, and others. We have an active customer footprint in 7 countries. Our goal is to serve one million EVs within the next 5 years, and as a company, touch a billion lives with technology.
Job Overview:
We are looking for an ML Intern – LLM & GenAI to join our team. This role is ideal for someone eager to work on cutting-edge conversational AI and contribute to the development of AskOX, the conversational AI layer of the OXRED platform that allows users to query fleet data through simple natural-language queries. The position focuses on building and enhancing the LLM backend that powers this system.
Key Responsibilities:
LLM and Backend Development
- Build the LLM-based backend logic (FastAPI preferred).
- Implement retrieval-augmented generation (RAG) to fetch structured and unstructured data from OXRED.
- Use frameworks like LangChain, LlamaIndex, or Haystack to manage context retrieval, query routing, and summarization.
- Develop prompt templates, intent classifiers, and structured query generators.
Integration and Testing
- Integrate the chatbot logic with the existing OXRED AskOX frontend.
- Define and test different query types (e.g., fleet-level summaries, vehicle-level drilldowns, performance metrics).
- Ensure secure and efficient data flow between OXRED APIs and the AskOX backend.
Optimization and Reporting
- Evaluate and improve retrieval accuracy, latency, and hallucination rates.
- Implement caching or schema-based memory for frequently accessed data.
- Provide sample test cases and response evaluation reports.
Documentation and Handoff
- Deliver modular, production-ready code and API documentation.
- Include retraining or model-upgrade guidelines.
- Provide sample conversation flows and schema mappings.
Requirements:
- Strong experience with LLMs, LangChain, or LlamaIndex.
- Proficiency in Python and FastAPI.
- Experience with retrieval pipelines, SQL, or CrateDB/PostgreSQL.
- Understanding of vector databases (e.g., FAISS, Chroma, Pinecone).
- Ability to design efficient, prompt, and retrieval logic for structured data queries.
- Knowledge in the EV Domain (brownie points).
Preferred Qualifications:
- Knowledge of EV analytics, fleet management, or IoT data.
- Familiarity with OpenAI, Anthropic, or Ollama models.
- Experience with embedding optimization and hallucination control.
- Prior exposure to multi-agent or AI orchestration frameworks.
How to Apply:
Interested candidates should submit their resume and a cover letter detailing their relevant experience and explaining why they are a good fit for this position to hr@datakrew.com with the subject line "ML Intern - LLM & GenAI – Application – [Your Name]"
Datakrew is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.