Description du poste
CONTEXT:
Professionals in the medical and pharmaceutical industries must navigate vast and complex regulatory corpora (EU directives, ISO standards, FDA guidelines, etc.). Searching for precise information in these documents is time-consuming and inefficient.The goal of this internship is to design and develop an intelligent conversational system to facilitate regulatory information retrieval.
Main objectives
Develop a chatbot system capable of:
- Indexing and structuring a database of regulatory texts.
- Understanding users’ natural language queries.
- Extracting and summarizing relevant information.
- Providing contextual answers with precise references to sources.
DEVELOPED SKILLS
- Advanced NLP: embeddings, semantic search, LLMs
- Data architecture: vector databases, indexing
- Full-stack development: REST API, user interface
- AI evaluation: relevance metrics, benchmarking
Desired profile
- Level: Master’s Degree (final year) / Engineering school
- Strong skills in Python and software development
- Knowledge in machine learning and NLP
- Interest and experience in conversational AI
- Prior experience with chatbot projects
- Rigor, autonomy, and proactive attitude
Duration: 4–6 months
Technologies: Python, NLP, RAG, Vector Databases, LLM