PinnedRustem FeyzkhanovinAWS in Plain EnglishServerless compute for LLM — with a step-by-step guide for hosting Mistral 7B on AWS LambdaOne of the challenges with using LLM in production is finding the right way to host the models in the cloud. GPUs are expensive so hosting…6 min read·Nov 16, 2023--7--7
PinnedRustem FeyzkhanovinAWS in Plain EnglishGuide for running Llama 2 using LLAMA.CPP on AWS FargateStep-by-step guide for deploying Llama 2 model to AWS using LLAMA.CPP as framework, Fargate for hardware and Copilot for deployment.5 min read·Oct 17, 2023--1--1
PinnedRustem FeyzkhanovinAWS TipHow AWS Lambda SnapStart eliminates cold starts for Serverless Machine Learning InferenceHow AWS Lambda SnapStart Removes cold starts for Serverless Machine Learning Inference by loading the model into the snapshot.4 min read·Nov 29, 2022----
Rustem FeyzkhanovinAWS in Plain EnglishMaking LLMs Scalable: Cloud Inference with AWS Fargate and CopilotIn this blog post, I’ll take you through a step-by-step guide on how to get LLMs up and running in the cloud with AWS Fargate and Copilot.5 min read·Aug 7, 2023----
Rustem FeyzkhanovinAWS TipScalable Cloud inference endpoint using ONNX and AWS FargateAs a Machine Learning Engineer, I often find myself deploying models to the cloud. It’s an essential part of getting our models out of the…6 min read·May 22, 2023----
Rustem FeyzkhanovML No/Low Code services and Use Cases @ Re:Invent 2022Machine Learning is becoming essential for a lot of industries, but setting ML projects for success is challenging and requires business…4 min read·Nov 14, 2022----
Rustem Feyzkhanov12 MLOps breakout sessions I’m looking forward to at Re:Invent 2022MLOps is a field of best practices for companies to run ML workflows in production. It encompasses a large stack of tasks from optimizing…4 min read·Nov 8, 2022----
Rustem Feyzkhanov7 things to know before using AWS PanoramaMachine learning is becoming essential for a lot of companies and they want to use it to optimize their operations and make new services…4 min read·Nov 26, 2021----
Rustem Feyzkhanov7 MLOps breakout sessions I’m looking forward to at Re:Invent 2021MLOps is an emerging field of best practices for businesses to run Machine Learning workflows in production. MLOps captures a very wide…3 min read·Nov 16, 2021----
Rustem FeyzkhanovinAWS TipMachine Learning Inference on AWS Lambda Functions powered by AWS Graviton2 ProcessorsMachine Learning became a necessity for a lot of companies — from Fortune 500 companies to small startups. With all the frameworks and…5 min read·Sep 29, 2021----