Lead Engineer / Backend & AI / Mansoura, EG

Mohamed
Yousry

I build production AI systems, scalable backends, and automation that actually ships. I run models on hardware they weren't meant for, put most of what I build out in the open source, and care a little too much about systems that stay up. Right now I'm leading engineering at Drive-Nova.

Mohamed Yousry
NowLead Engineer at Drive-Nova (UAE)
PrevPillar (Fractional CTO), MyAly.AI, TotallyNot, and freelance since 2020
01

Where I've worked

since 2020
Apr 2026 - Present
Current

Lead Engineer

Drive-NovaHybrid, UAE

Own the architecture and delivery across B2B and B2C platforms in the tyre industry.

  • Lead engineering across B2B and B2C platforms, including managed Magento storefronts, owning the architecture, the delivery, and keeping it all running.
  • Architecting the AI side of the roadmap: agent-to-agent (A2A) architecture and multi-agent frameworks like AG2.
  • Run integrations and the technical relationships with partner platforms across Europe, Egypt, and the UAE.
ArchitectureA2AAG2MagentoIntegrations
May 2025 - Present
Current

Fractional CTO & Lead AI / Backend Engineer

Pillar Construction GroupRemote, USA

Built a full-stack AI procurement platform from the architecture up to production.

  • Designed and shipped an AI procurement platform that aggregates 50,000+ federal (SAM.gov) and SLED (HigherGov) opportunities, with multi-schema PostgreSQL, semantic search over embeddings, and an event-driven processing system.
  • Built ELI, a conversational assistant with 24 procurement tools (opportunity search, document Q&A, board management) on GPT-5 / GPT-4o, plus a 4-tier matching pipeline and a PaddleOCR pass for scanned government forms.
  • Architected the microservices: FastAPI, Celery with 6 specialized workers, Redis, and Supabase. Shipped the whole production stack on Docker, Coolify, Nginx, Cloudflare, GitHub Actions, and Sentry.
FastAPICeleryRedisSupabasepgvectorPaddleOCRDocker
Feb 2025 - Apr 2025

AI Engineer & Automation Developer

MyAly.AIRemote, USA

Owned payments and infrastructure for an AI marketing product.

  • Built and maintained the Stripe stack: subscriptions, webhooks, and secure checkout.
  • Wired frontend flows into the backend APIs and tightened up performance along the way.
  • Set up and ran the infrastructure: domains, DNS, Nginx, and Git workflows.
StripeFastAPINginxDNSGit
Sep 2024 - Mar 2025

Co-founder & AI Engineer

TotallyNotRemote, USA

Co-founded a company building local, on-device AI.

  • Led the AI features for a local, on-device AI product built with Flutter.
  • Designed and deployed the scalable backend on FastAPI, PostgreSQL, and Docker.
  • Set up the CI/CD pipelines that kept deploys boring (which is the goal).
On-device AIFlutterFastAPIPostgreSQLDocker
Jan 2020 - Present
Current

Developer

FreelanceRemote

The long-running thread. Shipping production systems end to end since 2020.

  • Architected and deployed large systems end to end: AI pipelines, OCR systems, mobile apps, web platforms, and automation bots.
  • Built OCR and document-processing systems that handle preprocessing, extraction, validation, and clean structured output.
  • Ran the full infrastructure lifecycle: cloud setup, provisioning, Dockerized deploys, CI/CD, DNS, and production monitoring.
OCRAI pipelinesCloudDockerCI/CDMonitoring
02

Stuff I've built

mostly open source
strix-halo-unsloth
Getting ML to run on hardware it shouldn't
7Open source

The setup that actually gets PyTorch and Unsloth fine-tuning running on an AMD Strix Halo (gfx1151) with 128GB of unified memory. I tracked down the kernel regression and the exact ROCm versions that quietly break every other guide, then shipped prebuilt containers so nobody else loses the weekend.

PyTorchROCm 7.1UnslothLoRAPodmanLinux
CCloudS
Cloud storage that can't read your files
Open source

A self-hosted Drive where the server only ever holds ciphertext. Your keys are derived in the browser with Argon2id, sharing seals a file key to the recipient's X25519 key, and nothing sensitive leaves your machine in the clear. Dump the database and all you get is base64.

Next.jsFastAPIlibsodiumPostgreSQLMinIO
PorterOPS
A2A multi-agent, built for real
Open source

A multi-agent ops platform where a LangGraph DAG runs department agents over an async agent-to-agent message bus and streams the whole thing to a React dashboard over SSE. This is the A2A architecture from my resume, as working code you can read.

LangGraphFastAPIReactSSESQLite
illiterate
A grammar engine that checks the AI
Open source

A privacy-first grammar checker and rewriter in five languages. It runs LanguageTool's rules first, then an LLM, then validates the LLM against itself to kill the errors AI usually introduces. Flutter app, FastAPI backend, encrypted end to end.

FlutterFastAPILanguageToolDocker
CSphere
A VS Code style IDE run by 11 agents
In development

An IDE driven by 11 orchestrated MCP agents for coding, debugging, testing, and deploy, plus a free community marketplace for tools and extensions. Local-first by design: SQLite and ONNX embeddings run right inside Electron, with optional Supabase sync.

ElectronTypeScriptFastAPISQLiteONNXMCP
paperprobe
AI risk-of-bias for systematic reviews
In development

A multi-agent tool that scores research papers against the COSMIN risk-of-bias checklist, all ten boxes, with verbatim evidence quotes you can click straight to the right page in the source PDF. Built for real systematic reviews, not demos.

PythonFastAPIRAGEmbeddings
n8n-instagram-scraper
Reels in, transcripts out
4Open source

An Instagram Reels pipeline that downloads in full quality, strips metadata so it stays under the radar, transcribes with Whisper, ships to S3, and runs the whole flow through n8n. FastAPI backend, one-command Docker deploy.

FastAPIWhispern8nDockerAWS S3
RAG-Medical-Assistant
Clinical decision support
Open source

Pairs ML clustering with Llama 3.2 or Gemini to read patient data and surface severity, estimated cost, and triage priority. Runs fully local with Ollama or in the cloud, your call.

Pythonscikit-learnOllamaGeminiStreamlit
Pixeld
Photos into pixel-art guides
1Open source

Turns any image into a numbered pixel-art pattern for cross-stitch, Perler beads, or LEGO. Color clustering, optional dithering, multiple export formats. The fun one.

PythonFastAPIStreamlit
More from the workshopAll repos on GitHub
AI-PlaygroundLoad, quantize, and convert any LLM across GGUF, ONNX, and SafeTensors, then chat with it.AOE-Discord-botServer bot with auto-translation, Sheets-backed quotes, roles, polls, and moderation.
Private workPlus private client work I can't link: a government-procurement platform on SAM.gov with GPT-4, a speech-to-speech AI, crypto trading dashboards, a self-hosted Supabase stack, and a pile of Discord bots people actually use.
03

What I actually use

daily, mostly
Languages
PythonTypeScriptJavaScriptDartC++CC#PHPBash
Backend & APIs
FastAPIFlaskDjangoNext.jsRESTWebSockets / SSECeleryPydanticStripe
AI & ML
LLMsLangChainLangGraphMulti-agent (AG2, MCP, FastMCP)Llama.cppPyTorchTensorFlowFine-tuning (LoRA, Unsloth)WhisperPaddleOCREmbeddings (pgvector)ONNX RuntimeRAG
Databases
PostgreSQLSupabaseRedisMySQLSQLite
Mobile & Frontend
FlutterReactNext.jsElectronHTMLCSSTailwindFramer MotionVite
DevOps & Infra
DockerNginxCaddyLinuxCloudflareGitHub ActionsROCmSentrySelf-hosting
Bots & Embedded
Discord.pyTelegram botsArduinoIoTn8nSteam API
Education
New Mansoura University
Bachelor's, AI Engineering · 2021 - 2026 · GPA 3.3 / 4.0
04

Off the clock

yes, the garlic thing is real

I run ML on hardware it was never meant for, then write down what finally worked so the next person doesn't lose a weekend.

I'm an open-source person at heart. Most of what I build ends up public, and I lean on the community's work every single day.

I'd rather read the logs than guess. Most 'impossible' bugs turn out to be something boring I just hadn't looked at yet.

Off the clock it's music production and video editing. I edit better than I film, and yes, I can burn garlic in under two seconds.

05

Say hi

fastest: email

Let's build something that stays up.

The fastest way to reach me is email. I read everything and reply to most of it. Got a hard backend or AI problem? Even better.