Hugo Casero

AI / ML Engineering Team Lead

Profile

AI / ML Engineering Team Lead with 8+ years across NLP, applied ML, and production LLM and agentic systems. Currently leading a team of 5 at Medida, shipping RAG, document-grounded chatbots, agentic content-creation pipelines (LangChain + LangGraph), and GenAI video/image generation tools on AWS. Background in clinical / healthcare NLP, digital marketing, and verifiable-claim detection; co-author on three peer-reviewed clinical publications.

Professional Experience
01/2023 — Present
Team Lead — AI / ML Engineer
Medida — Remote/Madrid
  • Lead a team of 5 Data Scientists across sprint planning, technical strategy, hands-on coding, mentoring, and internal coding-practice sessions.
  • Built a production RAG tool over 3,000+ internal documents, evaluated with RAGAS, substantially improving documentation search and retrieval.
  • Shipped two LLM-based chatbots in production: a general chatbot with moderation and a document-grounded assistant serving ~200 internal users.
  • Designed and productionized a Google-ranking analytics pipeline processing ~4,000 datapoints per vertical on a 15-day cadence.
  • Built automated GenAI video and image generation pipelines, including a consistent-character image-generation flow.
  • Built an agentic content-creation pipeline (LangChain + LangGraph) for drafting, refining, and assembling long-form content.
  • Contributed to LLM-based SEO analysis, internal agents / assistants, web automation and scraping infrastructure, and Streamlit / Gradio reporting tools.
02/2022 — 01/2023
Senior AI / ML Engineer
Medida — Remote/Madrid
  • Trained tree-based and learning-to-rank models for SEO ranking prediction on published and unpublished content.
  • Ran feature-importance analysis to inform SEO strategy.
  • Built text translation, paraphrasing, summarization, scraping, and preprocessing pipelines.
  • Ran statistical significance testing on SEO ranking features using Bag-of-Words representations.
03/2019 — 02/2022
NLP Data Scientist
Savana — Remote/Madrid
  • Informally tech-led a sub-team of 4 Data Scientists across clinical NLP workstreams.
  • Built and deployed CNN, LSTM, and BERT-based NER models for medical entity detection, achieving F1 0.75–0.85 in production on thousands of clinical reports.
  • Owned the full NER pipeline end-to-end, from data annotation through training and evaluation to production deployment.
  • Built CNN-based classification models for medical reports.
  • Developed internal tooling for annotation and model evaluation.
09/2017 — 02/2022
Data Science Lecturer
MIOTI — Madrid
Lecturer for Data Science and Data Preprocessing across multiple Master's-level cohorts.
09/2012 — 10/2015
Software Engineer (R&D)
Gunnebo Deutschland GmbH — Munich, Germany
Backend, frontend, and embedded-systems development in the R&D department; built internal update and release-packaging tools.

Education
2016 — 2017
Master's Degree in Data Science
Univ. of Granada
2006 — 2011
Bachelor Degree in Computer Science
Univ. Pol. of València

Selected Publications
2024
Clinical utility of personalized reference intervals for CEA...
Clinical Chemistry and Laboratory Medicine (CCLM)
2023
Symptoms timeline and outcomes in ALS using AI
Scientific Reports - Nature
Links
Skills
  • Languages: Python, SQL
  • ML / Deep Learning: Keras, scikit-learn, XGBoost, CNNs, LSTMs, BERT, learning-to-rank, recommender systems
  • NLP / LLMs / GenAI: OpenAI APIs, AWS Bedrock, LangChain, LangGraph, agentic pipelines, RAG, RAGAS, NER, classification, summarization, sentiment, text-to-image, text-to-video
  • Search & Data: Elasticsearch, embedding-based retrieval, Bag-of-Words
  • Cloud & MLOps: AWS, S3, EC2, Bedrock, Docker, Airflow, MLflow, CI/CD
  • Web & APIs: FastAPI, Streamlit, Gradio
  • Leadership: Team Lead, sprint planning, technical strategy, mentoring, coding-practice sessions
Languages
  • Spanish (Native)
  • English (Professional)
  • German (Basic A1)