Javier Jiménez — Data Scientist
for real projects & play

I build LLM systems that are explainable, reliable and fun—especially when they power gaming & esports experiences.

🎮 AI × Gaming: Game-master agents, MTG rules assistants, and gameplay automation. 🤖 LLM systems: RAG, tool-use, multi-agent orchestration, evaluation & guardrails. Stack: Python, Hugging Face, LangGraph/LangChain, Numpy/Pandas, Streamlit, Azure/AWS, Docker
Javier's Profile
🤖
🎮
📊

About Me

Data Scientist focused on NLP/LLMs for gaming and esports applications

I'm a Data Scientist focused on NLP/LLMs who loves applying AI to gaming and esports. My path runs from research to startup work and into production LLM systems. I design small, testable components (retrievers, planners, tools, evaluators) and stitch them into robust pipelines, then I show them off with demos, not dashboards. I particularly enjoy problems that mix rules, strategy, and interaction, like a tabletop GM or a competitive card game bot.

Professional Journey

My evolution from academic researcher to AI entrepreneur, and now specializing in LLMs and GenAI

2024 - Present
🤖

Senior Data Scientist

Plain Concepts (Nestlé AI Team)

Working with Nestlé's AI research team through Plain Concepts to implement cutting-edge LLM solutions.

  • • Retrieval-Augmented Generation (RAG) systems
  • • AI Agents development and deployment
  • • Azure cloud solutions and Docker containerization
🚀
2022 - 2024

Founder & Data Scientist

Canonical Green

Developing expertise in cloud infrastructure, AI optimization, and scalable ML solutions through real startup experience.

  • • AI optimization for maritime sector applications
  • • Cloud infrastructure and ML deployment
  • • Startup foundation and technical leadership
2021 - 2022
🔬

AI Researcher

University of Cádiz (UCA)

Research in NLP and text analysis, building solid foundations in machine learning and academic methodology.

  • • NLP and text analysis research
  • • Machine learning fundamentals and applications
  • • Research methodology and academic writing

Flagship Projects

AI × Gaming/Esports

Mini case studies of my LLM systems for gaming applications

🏆

OwlBear LLM Chat — AI Game Master (MCP / multi-agent)

Goal

Make a conversational Game Master that plans, retrieves lore, manages the tabletop (dices, tokens, fog), and narrates scenes for tabletop RPGs.

Approach

Langgraph ReAct agent that uses MCP tools connected with the OwlBear API.

What I built

  • React Agent to interact with the tabletop.
  • Chat to talk with the players to manage lore and actions.
  • MCP server on Gradio with custom tools for dice, tokens, fog of war, rules, lore retrieval, and scene narration.
Tech Stack: Python, LangGraph, MCP, Anthropic, Gradio, Hugging Face
🃏

Gatherer Sage — Magic: The Gathering rules & strategy assistant

Goal

Provide rules-correct answers and card/line analysis for MTG, with sourceable citations.

Approach

Domain index of r/mtgrules Q&A and card text; retrieval + constrained generation for rules; optional fine-tuning experiments for jargon.

What I built

  • Data ingestion + cleaning with SoTA techniques (cards, ruling PDFs, forum Q&A).
  • Retrieval pipeline with rules-first prompts (RAG-style).
  • Fine-tune models on custom MTG data for jargon and strategy with huge amounts of data.
Tech Stack: Python, Hugging Face, vLLM, Pinecone, PyTorch
🧙‍♂️

JokerNet — Gameplay automation & CV

Goal

Explore computer-vision + LLM agents for gameplay automation in Balatro rogue like.

Approach

Screen capture → detection/classification → action planner → action executor

What I built

  • Docker environment to run the game and interact with it, in a secure way.
  • Feature extractors and heuristics for state; simple rule-policies for actions.
  • Complex LangGraph agent to plan actions from state, with tool-use for multimodal LLMs.
Tech Stack: Python, LangChain, LangGraph, OpenAI, Streamlit, Docker

Other projects

🎌

Animdle

Wordle-style anime guessing game with OP/ED clips.

Play

Publications

Peer-reviewed research in ocean engineering, AI, and computer vision

New analysis in the preliminary design for LNG and LPG ships

DOI: 10.1016/j.marstruc.2025.103863 Marine Structures 2025 Peer-reviewed

Authors: Mariano Marcos-Pérez, Javier Jiménez de la Jara, Daniel Precioso, Aurelio Muñoz, M. Victoria Redondo-Neble, David Gómez-Ullate

Official abstract (verbatim):

"In recent years, the production of LNG and LPG ships has increased, driven by the increasing demand for natural and petroleum gases. To meet this demand, ship designs must be optimized at all stages, requiring designers to have advanced, efficient and accurate design tools. This study presents new statistical and regression analyses of data extracted from the Hyundai Heavy Industries Shipbuilding Group catalogue which includes information on LNG and LPG ships built between 1979 and 2023. The database includes 145 LNG carriers and 322 LPG carriers, representing approximately 20% of the global gas carrier fleet. Simple and multiple regression analyses were used to estimate dependent variables and their correlation to other ship parameters. In addition, Machine Learning algorithms were trained and compared against these traditional methods. This study provides updated tools to support the preliminary design of LNG and LPG ships, enhancing the understanding and accuracy of early-stage design decisions."

My contribution:

Co-developed the ML baseline and regression comparisons; helped analyze dimensional ratios and trends used in preliminary design tools.

HADAD: Hexagonal A-Star with Differential Algorithm Designed for weather routing

DOI: 10.1016/j.oceaneng.2024.120050 Ocean Engineering 2025 Peer-reviewed

Authors: Javier Jiménez de la Jara, Daniel Precioso, Louis Bu, M. Victoria Redondo-Neble, Robert Milson, Rafael Ballester-Ripoll, David Gómez-Ullate

Official abstract (verbatim):

"We present HADAD (Hexagonal A-Star with Differential Algorithm Designed for weather routing), a novel optimization algorithm for weather routing. HADAD conducts a global exploration using an A⋆ search on a hexagonal grid with higher-order neighbors, enhancing directional flexibility and overcoming limitations of traditional graph searches that constrain vessel movements. It then refines the solution using a discrete Newton–Jacobi variational method, ensuring convergence to a locally optimal, smooth route in continuous space. To evaluate the effectiveness of HADAD, we developed a benchmark comprising 1,560 instances over a full year, varying in origin–destination pairs, vessel speeds and oceanographic conditions. Our results show that HADAD outperforms pure A⋆ graph search methods by an extra 4% savings with respect to the shortest-distance route, thanks to more flexible smoother trajectories obtained by gradient descent. In our seasonal study we observe that the savings distribution shows large seasonal variations (double savings on average in winter with respect to summer) and contains a significant number of outliers. Savings reach 27% in these cases of extreme weather events. Validation of the algorithm performed with synthetic vector fields has been conducted. In this setting, the algorithm has been adapted to handle fuel consumption optimization for Just-in-Time arrival. By integrating global search and local optimization, HADAD effectively balances computational efficiency with route optimality, offering a practical and adaptable solution for real-world weather routing applications."

My contribution:

First author—designed the hex-grid A* + FMS pipeline, implemented the optimizer, and built the year-long 1,560-instance benchmark & analysis.

NeoCam: A hybrid edge-cloud platform for non-invasive real-time monitorization of preterm neonates in intensive care units

DOI: 10.1109/JBHI.2023.3322078 IEEE Journal of Biomedical and Health Informatics 2023 Peer-reviewed

Authors: Ángel Ruiz-Zafra, Daniel Precioso, Blas Salvador, Simón P. Lubián-López, Javier Jiménez, Isabel Benavente-Fernández, Janet Pigueiras, David Gómez-Ullate, Lionel C. Gontard

Official abstract (verbatim):

"The number of babies born prematurely is rising every year... In this work we present NEOCAM, an open source platform conceived to be deployed in NICUs for the real-time non-invasive monitorization of preterm in incubators or beds using edge-computing devices... We show how information of preterm limb's activity, face emotion and breath rate can be measured continuously in real time by processing video with three AI-based models running in parallel and dedicated algorithms designed to be flexible and robust."

My contribution:

Co-developed AI/computer-vision components and focused on baby emotion tracking. Integrate all the AI models into a real-time edge-computing system.

Awards

Recognition in AI and technology competitions

🏆

Finalist — Gradio MCP Hackathon (OwlBear GM)

(2025)

Finalist in the international Model Context Protocol hackathon with the OwlBear LLM Game Master project.

View details
🌊

2nd place — Ocean Hackathon

(2021)

Global competition for innovation in ocean technologies. Converted in a startup (Canonical Green).

View details
👁️

2nd place — OpenCV AI Competition

(2021)

Global competition in computer vision and OpenCV applications with NeoCam project

View details

Contact

Simple & reliable ways to get in touch

Quick Contact

Direct contact is preferred for reliability. Choose your preferred method below.

Send me a message

Note: This form may not work reliably on static hosting. Direct email is recommended.