available · full-time · june 2026 · usa
// hi, i'm

Venkata Naga Sri
Komatlapalli

Building production-grade AI systems that ship — from RAG pipelines and multimodal transformers to serverless AWS inference APIs. M.S. Data Science & AI · UCM · 3.77 GPA.

AI Engineer & Data Scientist·PwC Alumni·AWS Certified Architect·Kansas City, MO
View Projects → GitHub ↗ LinkedIn ↗ Get In Touch
0AI/ML Projects
0GPA · MS AI
0arXiv Papers RAG'd
0Articles Analyzed
0Industry Roles
01 · about

AI Engineer & Data Scientist

I don't just build models — I ship AI systems. My work spans the full stack: data pipelines and model training through RAG architectures, transformer fine-tuning, and cloud-native deployment on AWS.

Currently pursuing my M.S. in Data Science & Artificial Intelligence at the University of Central Missouri (GPA: 3.77/4.0), I previously worked at PricewaterhouseCoopers in AI Competency and HealthEdge Technologies as a QA Engineer — giving me a rare blend of industry rigor and cutting-edge AI research.

I'm focused on Generative AI, LLMs, RAG pipelines, and multimodal AI. My arXiv-RAG project benchmarked three embedding models achieving perfect MRR 1.000 with BGE. My ClarifyAI platform hits sub-2-second query latency on a fully serverless AWS architecture.

I hold an AWS Certified Solutions Architect – Associate (Mar 2026) and am actively contributing to research on self-supervised learning methods.

Looking for full-time AI Engineer, Data Scientist, or ML Engineer roles — available June 2026.

~/kvnsri_profile.json
{
  "name": "Venkata Naga Sri K.",
  "role": "AI Engineer / Data Scientist",
  "gpa": 3.77,
  "location": "Kansas City, MO",
  "open_to_work": true,
  "available": "June 2026",
  "stack": [
    "PyTorch", "AWS Bedrock", "LangChain",
    "FAISS", "Streamlit", "HuggingFace"
  ],
  "interests": ["LLMs", "RAG", "MLOps"],
  "certified": "AWS Solutions Architect",
  "papers_RAGd": 120 // arXiv ML
}
02 · arsenal

Tech Stack

⚡ Generative AI & LLMs
RAG PipelinesAWS BedrockLangChainPrompt EngineeringFine-TuningAI AgentsVector DBsFAISSHuggingFaceBERTDistilBERTGPT
🧠 Deep Learning
PyTorchTensorFlowKerasTransformersAttention MechanismsCNNsLSTMsWav2Vec2ViTTransfer Learning
📊 Machine Learning
Scikit-learnXGBoostRandom ForestSVMGradient BoostingFeature EngineeringHyperparameter TuningPCAA/B Testing
☁️ Cloud & MLOps
AWS S3LambdaAPI GatewayEC2BedrockCognitoRedshiftDockerKubernetesCI/CDTerraformGit
🗄️ Data & Databases
PythonSQLPostgreSQLMongoDBNeo4jDynamoDBPandasNumPyApache SparkETL Pipelines
📈 NLP & Visualization
TF-IDFEmbeddingsspaCyNLTKTokenizationTableauPower BIStreamlitPlotlyDash
03 · work

Featured Projects

★ FeaturedRAG + LLM Pipeline
MAR 2026
arXiv-RAG — Semantic Research Q&A System
End-to-end QA system over 120 arXiv ML papers using FAISS vector search and Google Gemini. Benchmarked BGE, MPNet, and MiniLM vs BM25 across chunk sizes and retrieval depths. Evaluated on 20 manually curated QA pairs measuring Recall@K, Precision@K, MRR, Faithfulness, and Answer Relevance.
MRR 1.000 (BGE)7.7× Answer Relevance120 Papers8ms Retrieval
PythonFAISSGeminiStreamlitBM25SentenceTransformersarXiv API
★ FeaturedProduction · AWS Serverless
NOV 2025 – DEC 2025
ClarifyAI — Serverless RAG Document Assistant
Production-grade serverless document intelligence platform on AWS. RAG pipelines with Bedrock Knowledge Base for semantic retrieval and LLM-powered Q&A. Secure role-based access via Cognito with React.js frontend. Fully cloud-native, multi-user architecture.
<2s Query LatencyAWS BedrockServerlessMulti-user
AWS BedrockLambdaS3API GatewayCognitoReact.jsRAG
🎭 Deep Learning · Multimodal Fusion
OCT 2025 – NOV 2025
Multimodal Emotion Recognition via Cross-Modal Transformers
Multimodal emotion classification on MELD dataset fusing text (BERT), audio (Wav2Vec2), and visual (ViT) features. Custom cross-modal Transformer encoder with 4 attention layers and 8 heads. Class-weighted loss for imbalanced emotion distribution.
Accuracy · 59.96%Weighted F1 · 60.66%3 Modalities7 Classes
PyTorchBERTWav2Vec2ViTHuggingFaceMELD
📰 NLP · Transformers vs Classical
OCT 2025
Fake News Detection — Classical NLP vs Transformers
Systematic benchmark of TF-IDF + Logistic Regression vs fine-tuned DistilBERT on 65,000+ articles. Comprehensive precision, recall, F1, and ROC-AUC comparison. Full data preprocessing and cleaning pipelines with binary fake news classification.
65K+ ArticlesDistilBERT · 83%Classical · 94%ROC-AUC Benchmarked
DistilBERTTF-IDFScikit-learnNLTKHuggingFace
🏥 ML · Healthcare AI
MAR 2025 – APR 2025
Disease Prediction & Specialist Recommendation
ML ensemble predicting diseases from symptoms. 132-feature binary matrix covering 41 diseases and 131 symptoms with majority voting across LR, RF, SVM, KNN, and Naive Bayes. Ranked 563 doctor profiles by weighted satisfaction and availability scores.
98% Test Accuracy41 Diseases563 Doctor ProfilesMajority Voting
PythonScikit-learnRandom ForestSVMNaive Bayes
🌱 Sustainability · Analytics
NOV 2024 – DEC 2024
CarbonClean — Measure. Manage. Minimize.
Carbon footprint tracking platform enabling individuals and organizations to measure emissions, set reduction targets, and visualize environmental impact over time. Sustainability-first design with interactive dashboards and actionable insights.
Emissions TrackingInteractive DashboardGoal Setting
PythonData VisualizationAnalytics
04 · experience

Professional History

AUG 2023 — JUL 2024
Associate — AI Competency, Data & Analytics, Advisory
PricewaterhouseCoopers (PwC) · Hyderabad, India
  • Built foundational expertise in AI, GenAI, analytics, and data technologies through structured training in Big Data, Python, RDBMS, MongoDB, Tableau, and hypothesis testing.
  • Developed UI components using Angular and TypeScript for a frontend project, improving application functionality and user experience.
  • Applied data analysis and analytical reasoning to interpret business datasets and support advisory project outcomes.
  • Collaborated in agile development sprints using Jira for task tracking, stand-ups, sprint planning, and retrospectives.
  • Leveraged Git and CI/CD workflows to support frontend delivery in a fast-paced consulting environment.
Python · SQL · MySQL · PostgreSQL · Angular · TypeScript · MongoDB · Tableau · Power BI · Git · Jenkins · GenAI · Prompt Engineering
FEB 2023 — AUG 2023
Quality Assurance Intern
HealthEdge Technologies · Bangalore, India
  • Performed QA validation of HealthRules Payer modules verifying claims workflows, business rule execution, and healthcare member data accuracy.
  • Automated end-to-end test workflows using Cypress for REST and SOAP API validation, reducing regression test execution time by 40%.
  • Executed backend data validation using SQL queries for healthcare member data integrity in HealthRules Payer application modules.
  • Developed test plans, regression scripts, and defect reviews using Git, Jenkins, Maven, JUnit, and TestNG.
SQL · MySQL · PostgreSQL · Cypress · REST APIs · SOAP APIs · Git · Jenkins · Maven · JUnit · TestNG · Python · Linux
05 · education

Academic Background

AUG 2024 — MAY 2026
Master of Science
Data Science & Artificial Intelligence
University of Central Missouri · Warrensburg, MO
GPA: 3.77 / 4.0 · In Progress
AUG 2019 — MAY 2023
Bachelor of Technology
Computer Science & Engineering
Shri Vishnu Engineering College · India
CGPA: 8.52 / 10.0 · First Class

Certifications

AWS Certified Solutions Architect – Associate
Amazon Web Services · Mar 2026
Generative AI Mastermind
OutSkill · Feb 2026
Introduction to Generative AI Learning Path
Google Cloud / Coursera · Nov 2025
Advanced Python & Machine Learning Training
WISE Program · TalentSprint · Accenture · 2020–2021

Research & Presentations

Self-Supervised Learning Methods Evolution
Research Paper in Progress · Dec 2025 – Present
IEEE TASLP — NLP Paper Study & Presentation
Transactions on Audio, Speech & Language Processing · Feb 2026
BERT: Pre-training of Deep Bidirectional Transformers
Deep Learning Study & Paper Presentation · Jun 2025
06 · contact

Let's Connect

Actively seeking full-time AI Engineer, Data Scientist, and ML Engineer roles in the United States. Available to start June 2026. Let's build something that matters.

✉️ Send a Message ↗
KVN
Venkata Naga Sri K.
AI Engineer & Data Scientist · MS AI · UCM
Machine LearningLLMsNLPDeep LearningAWSRAGTransformersMLOps
Open to Opportunities
Send a Message ↗