PhD in Computer Science & AI · AI Engineer
Building production-ready multimodal AI systems and scalable architectures. Specializing in Generative AI, language models, and explainability. PhD completed with Summa Cum Laude.
I hold a PhD in Computer Science and Artificial Intelligence from the University of Murcia, awarded with **Summa Cum Laude** honors. Over the past 4+ years, my work has centered on bridging the gap between cutting-edge AI systems and robust, production-ready software engineering.
My expertise focuses on developing multimodal architectures (combining audio processing with Large Language Models), orchestrating scalable data pipelines, and implementing Explainable AI (XAI) frameworks. I prioritize building reliable, high-availability microservices over simply chasing theoretical benchmarks.
Currently seeking: AI Engineer, Machine Learning Engineer, or Senior Python Developer roles in high-impact environments. Immediately available for international remote contract opportunities (B2B) or relocation.
Designed and deployed a decoupled microservices architecture (Python, FastAPI, Docker, Celery) for multimodal AI inference, optimizing processing latency across local and cloud infrastructure. Built end-to-end audio processing pipelines integrating ASR (Whisper), speaker diarization, paralinguistic feature extraction, and semantic vector embeddings (FAISS). Applied Explainable AI (XAI) frameworks utilizing SHAP values to interpret fine-tuned BERT models, delivering complete system transparency. Published 5 high-impact, first-author research papers in venues like IEEE Access and Applied Sciences.
Provided L2 technical and architectural support for enterprise clients on Google Cloud Platform. Handled deep troubleshooting for production workloads involving Compute Engine, core VPC networking, IAM, and Google Kubernetes Engine (GKE). Collaborated with SRE teams to analyze and fix root-cause infrastructure degradation.
Deployed an industrial real-time CNN for automated defect detection, achieving 98% accuracy and sub-100ms inference latency running directly on resource-constrained embedded hardware. Developed optimized LSTM-based NLP systems for semantic complexity analysis. Built and maintained automated, multi-source ETL pipelines for tabular, image, and raw audio data assets.
View my complete publication list on Google Scholar
Doctoral dissertation engineering a scalable multimodal AI ecosystem designed to process, segment, and interpret heterogeneous classroom variables with a fundamental focus on architectural explainability.
View Thesis →Combined text (BERT) and audio features to classify different types of teacher interventions in classrooms.
View Publication →Systematic review of 82 studies (2014-2024) on using audio processing in educational research.
View Publication →Explored techniques to make multimodal AI models generalize better across different classroom settings.
View Publication →Built a system to automatically analyze classroom recordings and generate insights for teachers.
View Publication →Analyzed how student response systems work in real classrooms using multimodal AI.
View Publication →Seeking engineering challenges in Generative AI, system deployment, and multimodal architectures. Open to B2B contracting or global relocation opportunities.