Hugo Casero

AI / ML Engineering Team Lead

Profile

AI / ML Engineering Team Lead with 8+ years across NLP, applied ML, and production LLM and agentic systems. Currently leading a team of 5 at Medida, shipping RAG, document-grounded chatbots, agentic content-creation pipelines (LangChain + LangGraph), and GenAI video/image generation tools on AWS. Background in clinical / healthcare NLP, digital marketing, and verifiable-claim detection; co-author on three peer-reviewed clinical publications.

Professional Experience

01/2023 — Present

Team Lead — AI / ML Engineer

Medida — Remote/Madrid

Lead a team of 5 Data Scientists across sprint planning, technical strategy, hands-on coding, mentoring, and internal coding-practice sessions.
Built a production RAG tool over 3,000+ internal documents, evaluated with RAGAS, substantially improving documentation search and retrieval.
Shipped two LLM-based chatbots in production: a general chatbot with moderation and a document-grounded assistant serving ~200 internal users.
Designed and productionized a Google-ranking analytics pipeline processing ~4,000 datapoints per vertical on a 15-day cadence.
Built automated GenAI video and image generation pipelines, including a consistent-character image-generation flow.
Built an agentic content-creation pipeline (LangChain + LangGraph) for drafting, refining, and assembling long-form content.
Contributed to LLM-based SEO analysis, internal agents / assistants, web automation and scraping infrastructure, and Streamlit / Gradio reporting tools.

02/2022 — 01/2023

Senior AI / ML Engineer

Medida — Remote/Madrid

Trained tree-based and learning-to-rank models for SEO ranking prediction on published and unpublished content.
Ran feature-importance analysis to inform SEO strategy.
Built text translation, paraphrasing, summarization, scraping, and preprocessing pipelines.
Ran statistical significance testing on SEO ranking features using Bag-of-Words representations.

03/2019 — 02/2022

NLP Data Scientist

Savana — Remote/Madrid

Informally tech-led a sub-team of 4 Data Scientists across clinical NLP workstreams.
Built and deployed CNN, LSTM, and BERT-based NER models for medical entity detection, achieving F1 0.75–0.85 in production on thousands of clinical reports.
Owned the full NER pipeline end-to-end, from data annotation through training and evaluation to production deployment.
Built CNN-based classification models for medical reports.
Developed internal tooling for annotation and model evaluation.

09/2017 — 02/2022

Data Science Lecturer

MIOTI — Madrid

Lecturer for Data Science and Data Preprocessing across multiple Master's-level cohorts.

09/2012 — 10/2015

Software Engineer (R&D)

Gunnebo Deutschland GmbH — Munich, Germany

Backend, frontend, and embedded-systems development in the R&D department; built internal update and release-packaging tools.

Education

2016 — 2017

Master's Degree in Data Science

Univ. of Granada

2006 — 2011

Bachelor Degree in Computer Science

Univ. Pol. of València

Selected Publications

2024

Clinical utility of personalized reference intervals for CEA...

Clinical Chemistry and Laboratory Medicine (CCLM)

2023

Symptoms timeline and outcomes in ALS using AI

Scientific Reports - Nature

Links

Skills

Languages: Python, SQL
ML / Deep Learning: Keras, scikit-learn, XGBoost, CNNs, LSTMs, BERT, learning-to-rank, recommender systems
NLP / LLMs / GenAI: OpenAI APIs, AWS Bedrock, LangChain, LangGraph, agentic pipelines, RAG, RAGAS, NER, classification, summarization, sentiment, text-to-image, text-to-video
Search & Data: Elasticsearch, embedding-based retrieval, Bag-of-Words
Cloud & MLOps: AWS, S3, EC2, Bedrock, Docker, Airflow, MLflow, CI/CD
Web & APIs: FastAPI, Streamlit, Gradio
Leadership: Team Lead, sprint planning, technical strategy, mentoring, coding-practice sessions

Languages

Spanish (Native)
English (Professional)
German (Basic A1)