LLM / Agent Platforms
- A2A framework design
- Multi-hop reasoning agents
- RAG
- Vertex AI / Azure OpenAI
AI / Cloud / Distributed Systems
Staff-level Software Engineer / AI Platform & Cloud Architect
Building production LLM platforms, edge AI systems, and distributed data infrastructure from prototype to enterprise deployment.
Profile Focus
About
I am a senior software engineer with 10+ years building production AI, computer-vision, and distributed systems end-to-end, from low-latency C++ edge inference engines to cloud-native LLM platforms.
My work sits at the intersection of AI platform architecture, RAG systems, edge-cloud deployment, and large-scale data pipelines on GCP and AWS, with repeated ownership from prototype through commercialized enterprise deployment.
Now · Updated Jun 2026
A snapshot of where my attention is right now, refreshed when things shift. Reach out if any of this overlaps with what you’re building.
Building
Sider — word-graph sync architecture and CEFR-aware review scheduling
Shipping
lennonlin.dev itself — static Next.js export on Cloudflare Pages with an origin-gated Notion OAuth Worker
Exploring
MCP server for Notion auto-import and richer Claude tool-use loops
Writing
Build log: Securing a Notion OAuth proxy on Cloudflare Workers
Expertise
Experience
Jan 2023 — Present
Acer — Advanced Tech BU
Lead Architect for the cloud-native AI Agent Platform on GCP. Designed a modular Agent-to-Agent (A2A) framework with dynamic runtime loading, multi-agent orchestration, real-time ASR, and large-scale RAG retrieval.
Jun 2019 — Jan 2023
Acer — Advanced Tech BU
Owned end-to-end engineering of a commercial edge-to-cloud AI platform deployed across retail and transportation. Built a real-time C++ inference engine with Cython/Python integration achieving sub-second latency in production.
Feb 2017 — Jun 2019
Acer — Advanced Tech BU
Simulation-driven model development. Built a virtual-to-physical feedback loop pairing GTA-V environments with real-time shared-memory inference for autonomous-driving perception and control.
Independent Projects
Personal builds that validate the same edge-cloud, on-device ML, and multi-LLM architectural patterns I use at work — except I own the product decisions end-to-end.
Mobile language learning · Word-graph vocabulary
A language-learning app organised around a graph-based Word Map: vocabulary nodes link by semantic, morphological, and contextual relationships so review sessions surface what's most reinforceable next. On-device personalization, no per-user backend cost.
Chrome + Firefox extension · multi-LLM routing
Browser extension that translates selected text via Gemini, Claude, or Azure OpenAI, classifies words by CEFR level, and exports vocabulary into Notion. Backed by a Cloudflare Worker acting as a secure OAuth proxy so the client never ships the Notion client secret.
Music practice tool · stem separation + AI beat align
A music-practice app for instrumentalists: metronome with feel control, tempo-preserving slowdown, on-device stem separation (Demucs), and AI-assisted beat alignment for jam loops. Free / Pro / Plus tiers, designed around real practice workflows.
Selected Work
01
A2A framework / RAG / multi-LLM routing
Architected a cloud-native AI platform on GCP with dynamic runtime agent loading, multi-agent orchestration, and multi-LLM routing across GPT-4 Turbo, Claude 3.5 Sonnet, and Gemini 2.0 Flash.
02
LLM workflow automation
Delivered a multilingual technical-translation agent with a four-stage verification pipeline covering exact match, AI review, generalization, and human review.
03
Commercial edge-to-cloud computer vision
Led architecture and engineering of a commercial n:n face-recognition platform with real-time C++ inference, edge hardware optimization, and cloud verification services.
04
Spark / Hadoop analytics infrastructure
Re-architected global telemetry ingestion and preprocessing systems using Spark and Hadoop MapReduce for device analytics and OTA workflows.
Career Highlights
10+ years in software engineering
Built AI products from prototype to commercial deployment
Validated RAG quality with 3,131 human-verified QA pairs
Delivered 29-language translation automation at 99.9% accuracy
Achieved 97.24% MegaFace accuracy for edge face recognition
Reduced ETL runtime from 24 hours to 2 hours
Processed about 50M-90M telemetry CSV rows per day
Contact