Overview Retrieval-Augmented Generation (RAG) is quickly becoming a standard design pattern for grounding Large Language Models (LLMs) with external knowledge. Instead of relying solely on the training data of the model (which may be outdated or incomplete), RAG retrieves relevant facts from a knowledge base and passes them into the prompt. This ensures responses are…
GenAI, Agentic Workflows & Knowledge Base with Amazon Bedrock
Overview Incorporating Generative AI (GenAI) into your anomaly detection process can make workflows more adaptive, context-aware, and capable of reasoning. This project showcases how GenAI can be leveraged to create predictive maintenance workflows using Amazon Bedrock. Retrieval Augmented Generation (RAG) with knowledge bases With RAG (Retrieval-Augmented Generation), you can enhance foundation models using company-specific data…
Amazon Rekognition – Computer Vision
Overview AWS offers 20+ AI services, one of them is Amazon Rekognition. I develop a data science template using Amazon Rekognition to demonstrate how an ML-Ops framework integrates with managed AI services. Packaged as a CloudFormation stack and deployed via AWS Service Catalog, it delivers a fully integrated, end-to-end CI/CD pipeline for image classification and…
Reinforcement Learning
Overview This template is a starting point for building and operating Reinforcement Learning (RL) training and serving on AWS. Using demand and profit optimization as examples, it implements a Deep RL workflow to illustrate the end-to-end ML-Ops framework—covering orchestration, data pipelines, training, evaluation and deployment. The document discusses RL at a high level, but its…
AWS Sagemaker AI Studio User Journey
Overview This post summarizes the user journey for leveraging the ML-Ops framework within SageMaker Studio. The framework is built on the AWS sample project MLOPS-SM-PROJECT-TEMPLATE-RT. The solution architecture is illustrated below: This sample project features a self-mutating CDK pipeline (shown below) that deploys a Service Catalog application to the client’s account. The Service Catalog provides…
About Me
I’m Pluto Gasanova — a Machine Learning & MLOps Engineer who thrives on big challenges and bold goals. By day, I architect end-to-end AI systems in the cloud; by choice, I push my limits in Ironman triathlons and ultra-endurance cycling. In tech, I’ve built scalable ML frameworks on AWS that take models from concept to…
Let’s Connect
Got a question about AI, MLOps, machine learning, or LLMs? Curious about triathlons, endurance racing, or training tips? Or maybe you just want to say hi — I’d love to hear from you! Drop me a message via: Email: pgasanova@gmail.comLinkedIn: https://www.linkedin.com/in/pgasanova/ Or use the form below to reach out directly. I promise to get back…
MLOps Orchestrator with AWS Sagemaker AI
Overview The AWS Sagemaker AI framework uses CodePipeline as the workflow orchestrator. It manages tasks such as building, deploying, testing, preparing data, training models, running inferences, and evaluating results. We define this pipeline using Python CDK in a CloudFormation template, and it operates within the same AWS account. The above diagram illustrates the following sequence:1….
Hiking to Snowdon Summit – Spring 2025
Last spring, I set out on an unforgettable journey to the summit of Snowdon, the highest peak in Wales, starting from the village of Llanberis. I wasn’t alone on this adventure—two Argentinian friends, both seasoned hikers from near the Andes, joined me. Their experience with rugged, high-altitude treks added a special energy to the day….
Santini 600 : Cycling 600 km in 24 hours for a Good Cause
It started from a cordial chat over coffee at Ciclo e cafe, Changi Village in Nov 2021, Amos the cafe owner and Santini the official sponsor of Tour de France 2022 and Ironman races all over the world, presented us with opportunity yet an upheaval challenge to complete cycling 600 km within 24 hours on boxing day to…