Shan Wijenayaka

Lead AI Engineer | AI Platforms & Distributed Systems

From sub-100ms derivatives trading to sub-600ms real-time voice AI - building production systems where latency and correctness aren't negotiable.

AI Platforms
p95 <600msvoice AI latency
Distributed Systems
35k/daytrades processed
Cloud & Reliability
99.95%platform uptime
Scale & FinTech
37Musers reached
Leadership
8+ years3+ years AITech Lead scope
01.

Experience

Sep 2025 - Present

Lead AI/ML Engineer

AI Platform & Security Operations

Certis, Singapore

Joined to build Certis's AI platform from the ground up - designing and shipping a multi-service system spanning real-time voice AI, computer vision, and centralized model governance for enterprise security operations.

  • Built a real-time Voice AI pipeline (STT, agent orchestration, TTS) with safety guardrails and human handoff, achieving p95 latency under 600ms at production scale
  • Shipped computer vision workflows for operational triage - inference services, audited pipelines, and human-in-the-loop review - replacing manual incident classification
  • Defined the platform architecture, grew the team through hands-on hiring, and set engineering standards - enabling the team to ship voice AI and CV services to production independently
PythonLangGraphLangChainPyTorchAWS EKSOpenSearchpgvectorFastAPI
Sep 2024 - Sep 2025

Senior Software Engineer

Regulatory Intelligence Platform

RegASK, Singapore

Sole architect on a production LLM-powered regulatory platform enabling search and Q&A over 1M+ regulatory documents, owning the full stack from retrieval pipeline to deployment.

  • Built evaluation and regression harnesses that gated every deployment, catching answer-quality regressions before they reached production in a compliance-critical environment
  • Diagnosed reliability bottlenecks through targeted instrumentation, improving uptime from ~99.5% to ~99.95% and reducing error rates by ~20%
  • Right-sized compute and tuned auto-scaling policies, cutting cloud spend by ~35% with zero SLA regressions
PythonFastAPILangChainRAGMongoDB Atlas Vector SearchAWS EKSReactTypeScript
Feb 2023 - May 2024

Senior Software Engineer

Derivatives Trading Systems

TP ICAP, Singapore

Replaced legacy batch-oriented trade processing with low-latency event-driven microservices in Go at the world's largest interdealer broker, with strict consistency and ordering guarantees across OTC derivatives.

  • Designed the trade processing pipeline end-to-end, handling ~35k trades/day at sub-100ms service-level latency for interest rate and currency swap instruments
  • Led zero-downtime migration of 12+ services from Java/MSSQL to AWS-native stack (Kafka/MSK, DynamoDB Streams, Aurora PostgreSQL), choosing incremental extraction over big-bang cutover to maintain zero trading interruptions throughout
  • Designed and integrated ML-based anomaly scoring into pre-clearing, building explainable flags that traders and operations could act on directly - reducing downstream breaks
  • Instrumented the full service mesh with OpenTelemetry, cutting unknown tail-latency causes by ~40% and establishing SLI/SLO baselines for each service
GoJavaKafka/MSKDynamoDBAurora PostgreSQLAWS EKSgRPC
Dec 2021 - Feb 2023

Senior Software Engineer

Consumer Dining Platform

Chope, Singapore

Modernized the backend of a high-traffic reservation platform, migrating from monolith to microservices while keeping real-time booking and availability flows running across APAC markets.

  • Defined service boundaries, rate-limiting, and caching strategies that maintained latency SLAs under peak dinner-hour traffic spikes
  • Built microservices for real-time restaurant availability, booking, and waitlist orchestration serving concurrent users across multiple markets
  • Led monolith decomposition into independently deployable services, enabling per-team ownership and faster release cycles
GoPythonPostgreSQLAWSNode.jsTypeScript
2017 - 2021

Wiley Global Technology / Sysco LABS Sri Lanka / Omobio (Pvt) Ltd.

Progressed from associate to senior engineer across Sysco LABS (Sysco Corporation tech arm), Wiley, and Omobio. Built backend platforms across enterprise and telco domains, including a USSD platform from scratch for Robi Bangladesh reaching 37M+ subscribers.

JavaSpring BootReactNode.jsAWSPythonMySQLPostgreSQL
02.

Expertise

AI & Machine Learning

LLM Orchestration (LangChain, LangGraph)Retrieval-Augmented Generation (RAG)Agentic Workflows & AI AgentsLLMOps & Model RoutingComputer Vision PipelinesVoice AI (STT/TTS)AI Safety & GuardrailsEmbedding Models & Vector Search (OpenSearch, pgvector, MongoDB Atlas)Fine-tuning (LoRA/QLoRA)PyTorchLLaMAPrompt EngineeringMultimodal PipelinesModel Evaluation & Regression HarnessesRBAC, Audit Logging, Human-in-the-Loop

Frontend & Full-Stack

React / Next.jsTypeScriptNode.js / ExpressREST API DesignWebSocket / WebRTCTailwind CSSResponsive DesignHTML5 / CSS3

Languages

GoPythonTypeScript / Node.jsJavaJavaScriptSQLRust (fundamentals)

Distributed Systems & Backend

System DesignEvent-Driven Architecture (Kafka/MSK)Microservices DesignAPI Design & REST/gRPCConcurrency & Performance TuningWebSocket / Real-time CommunicationMessage Queue ArchitectureCQRS / Event SourcingDynamoDB StreamsLow Latency SystemsHigh Throughput ProcessingFastAPISpring BootDjangoExpress.jsRedis

Cloud-Native & Platform Engineering

AWS (EKS, Aurora, DynamoDB, MSK, S3)Kubernetes & HelmDocker & ContainerizationCI/CD (GitHub Actions, ArgoCD, Jenkins)TerraformGoogle Cloud Platform

Observability & Reliability

OpenTelemetrySLI/SLO Design & Incident ResponsePrometheus & GrafanaELK / DatadogLoad Testing & Performance EngineeringCapacity Planning

Databases & Storage

PostgreSQL / AuroraDynamoDBVector Databases (OpenSearch, pgvector)MongoDB / NoSQLMySQLRedis

Domain: FinTech & Trading

Derivatives Trading SystemsPre-clearing & Risk DetectionInterest Rate / Currency SwapsWeb3 & Blockchain FundamentalsFinTech InfrastructureRegulatory Compliance (RegTech)

Engineering Leadership

System Architecture & Design ReviewsTechnical MentoringHiring & Interviewing EngineersLeading Development TeamsTest-Driven Development (TDD)BDD (Cucumber)Agile / SAFeCode Review
03.

Certifications

AI & Machine Learning

Model Context Protocol: Advanced Topics
Anthropic (2026)
Generative AI: Fundamentals to Advanced Techniques
National University of Singapore (2025)
Generative AI with Large Language Models
DeepLearning.AI (2025)
Machine Learning Specialization
Stanford University (2024)
Advanced Learning Algorithms
Stanford University (2024)
Supervised Machine Learning: Regression and Classification
Stanford University (2024)
Machine Learning in Production
Stanford University (2024)

Programming & Systems

UCI
Concurrency in Go
University of California, Irvine (2024)
Oracle Certified Professional, Java SE 6 Programmer
Oracle (2016)
Advanced SQL for Query Tuning and Performance Optimization
LinkedIn Learning (2025)
Tuning Kafka
LinkedIn Learning (2025)
Software Architecture: Domain-Driven Design
LinkedIn Learning (2021)

Cloud & Infrastructure

AWS Certified Solutions Architect - Associate
Amazon Web Services (2024)
Distributed Load Testing Using Kubernetes
Google Cloud (2024)

Blockchain & Fintech

Web3 and Blockchain Fundamentals
INSEAD (2024)

Agile & Leadership

Certified SAFe 5 Agilist
Scaled Agile, Inc. (2020)
04.

Recommendations

What colleagues say

Recommended by engineering leaders and technical collaborators.

PJ
VN
VW
3 recommendations on LinkedIn
Read recommendations on LinkedIn
05.

Education

BSc (Hons) Computer ScienceSecond Class UpperUniversity College Dublin
2014 - 2018