Reval Logo
Home Browse Talent Skilled in Data Preprocessing

Vetted Data Preprocessing Professionals

Pre-screened and vetted.

Data PreprocessingPythonDockerSQLAWSTensorFlow
SD

Surya Danturty

Screened

Intern AI/ML Engineer specializing in computer vision and time-series forecasting

Riverside, CA0y exp
University of California, RiversideUC Riverside

“Undergrad who built a production RAG chatbot for a messy college website using OpenAI embeddings + FAISS, overcoming hard-to-crawl/non-selectable site content and strict API budget limits. Applies information-retrieval best practices (section-based chunking with overlap, precision/recall evaluation) and reliability techniques (edge-case testing, similarity thresholds, fallback responses), and has experience scaling similar indexing work to ~300,000 Wikipedia pages.”

CPythonJavaJavaScriptSQLHTML+74
View profile
JK

Jitesh Kumar S

Screened

Junior Machine Learning Engineer specializing in NLP, computer vision, and MLOps

Lafayette, IN3y exp
YaarcubesUniversity of Maryland, College Park

“ML/LLM engineer with Meta experience building production AI systems for near real-time user-report classification and summarization under strict latency (<250ms), safety, cost, and privacy constraints. Has hands-on MLOps/orchestration experience (Airflow, Spark, MLflow, Kubernetes, Docker, GitHub Actions) plus observability (Prometheus/Grafana) and applies rigorous evaluation, staged rollouts, and A/B testing to keep agent workflows reliable in production.”

PythonSQLBashShell ScriptingJavaC+++99
View profile
MV

Mohith Venkata

Screened

Mid-level Full-Stack Developer specializing in cloud-native APIs and data workflows

Tukwila, WA4y exp
Reshmi’s Group Inc.Seattle University

“Built and owned end-to-end ordering and inventory/order management systems for a wholesale distributor, delivering an MVP quickly and iterating based on direct observation of daily users. Experienced with TypeScript/React + Node.js layered architectures and microservices using RabbitMQ, including real-world scaling issues (duplicates, backpressure) and observability practices (correlation IDs, structured logging).”

PythonJavaJavaScriptTypeScriptC++C#+147
View profile
NB

nitesh bommisetty

Screened

Mid-level Data Scientist specializing in ML, NLP, and LLM-powered solutions

Tampa, FL4y exp
LumenUniversity of South Florida

“AI/NLP-focused practitioner who built a zero-/few-shot LLM event extraction system on the long-tail Maven dataset, combining prompt-structured outputs with LoRA/QLoRA fine-tuning and rigorous F1 evaluation. Also implemented entity resolution/data cleaning pipelines and embedding-based semantic search using Sentence-BERT + FAISS, and has healthcare experience delivering a multilingual speech/translation mobile prototype using HIPAA-compliant Azure Cognitive Services.”

PythonRSQLTensorFlowPyTorchKeras+123
View profile
MM

Maheswar Mekala

Screened

Mid-level Machine Learning Engineer specializing in NLP, recommender systems, and MLOps

OH, USA5y exp
General MotorsUniversity of Dayton

“ML/LLM engineer with production experience at General Motors building Transformer-based search and recommendation personalization for a high-traffic vehicle platform. Delivered significant KPI gains (17% conversion lift, 14% bounce-rate reduction) and optimized real-time inference via ONNX Runtime and INT8 quantization while implementing robust MLOps (Airflow/MLflow, monitoring, drift-triggered retraining) and stakeholder-facing explainability/dashboards.”

PythonPandasNumPyScikit-learnSQLGit+101
View profile
AK

AneeshReddy Kusa

Screened

Junior Full-Stack Software Engineer specializing in React and FinTech

Hyderabad, India2y exp
CognizantUniversity of Cincinnati

“Full-stack engineer with banking-domain experience (Cognizant/Kotak) building and optimizing high-usage transaction/account APIs on Spring Boot/Node/PostgreSQL in AWS/Docker, including peak-load performance fixes. Also built an end-to-end retail demand-forecasting feature during a master’s program, spanning data pipelines, ensemble models, dashboards, and operational guardrails like validation and fallbacks.”

JavaTypeScriptJavaScriptSQLC++React+75
View profile
SA

sahithi A

Screened

Mid-level AI Engineer specializing in LLM agents and RAG for health-tech

Remote6y exp
Milton AITexas Tech University

“Backend engineer with health-tech AI platform experience who designed a modular FastAPI/PostgreSQL architecture supporting real-time user data and swap-in AI workflows. Has hands-on production experience with observability (CloudWatch, structured logging, LangSmith/LangGraph/LangChain tracing), secure auth (OAuth2/JWT, RBAC, RLS), and careful data-pipeline migrations using parallel runs and rollback planning.”

AgileAPI IntegrationAWSBackend DevelopmentCI/CDClassification+121
View profile
LP

Lakshmi Priya Ramisetty

Screened

Mid-level ML & Data Engineer specializing in GenAI, graph modeling, and fraud/risk analytics

Redwood City, CA5y exp
BlueArcYeshiva University

“Built a production AI fraud/risk scoring platform at BlueArc that ingests web business/product/site data, generates text+image embeddings, and connects entities in a graph to detect reuse patterns and links to known bad actors. Optimized for scale with incremental graph re-scoring and delivered investigator-friendly explainability by surfacing the exact signals/relationships behind each score; orchestrated workflows with Airflow and GCP event-driven components (Pub/Sub, Dataflow, Cloud Run) and has recent LLM workflow orchestration experience (retrieval, prompting, scoring).”

PythonSQLPySparkApache AirflowETLPostgreSQL+92
View profile
AK

Akshay Krishna Varma Buddharaju

Screened

Junior Machine Learning Engineer specializing in computer vision and generative AI

1y exp
INV TechnologiesKennesaw State University

“CoreAI intern at The Home Depot who improved the Magic Apron Assistant by building a production video ingestion + RAG retrieval system for long videos (uploads and YouTube), including a graph-based retrieval module to speed up and improve relevance. Experienced with Kubernetes orchestration (HPA) and production reliability practices like caching, monitoring, regression testing, and stakeholder-driven requirements.”

Automated TestingAWSBERTCC++CI/CD+84
View profile
SS

Sai somapalli

Screened

Senior LLM Engineer specializing in Generative AI, RAG, and multimodal assistants

USA6y exp
Stellar AI SolutionsCampbellsville University

“GenAI/NLP engineer with experience building classification and summarization pipelines in PyTorch and deploying multimodal GPT-4-style workflows. Has integrated LLM applications across OpenAI, Azure OpenAI, and Amazon Bedrock, and uses LangChain/LlamaIndex/Semantic Kernel to orchestrate RAG and agent workflows with production-focused evaluation metrics like task success rate and groundedness.”

Generative AILarge Language Models (LLMs)ClaudeLlamaLangChainRetrieval-Augmented Generation (RAG)+83
View profile
AG

Aravind Gudipudi

Screened

Mid-level AI/ML Engineer specializing in MLOps and cloud-deployed ML systems

Austin, TX3y exp
PurevisitxUniversity of Illinois Springfield

“ML/AI engineer who built and productionized an NLP system at PurevisitX, orchestrating end-to-end ML workflows with Airflow (S3 ingestion through auto-retraining) and optimizing for drift and low-latency inference. Also partnered with Citibank risk teams on a fraud detection model, translating results via dashboards and iterating thresholds based on stakeholder feedback.”

A/B TestingAgileApache AirflowAWSAWS GlueAWS Lambda+93
View profile
SK

SaiGanesh Konagalla

Screened

Mid-level ML Engineer specializing in NLP and Generative AI

Houston, TX4y exp
Epic SystemsUniversity of Central Missouri

“Healthcare AI/ML engineer with Epic experience who built and deployed a HIPAA-compliant GPT-4 RAG clinical assistant over large medical document sets, emphasizing privacy controls and low-latency performance. Also automated end-to-end retraining and deployment of patient risk models using orchestration/CI-CD (Jenkins, SageMaker, MLflow), cutting deployment time from hours to minutes while improving reliability.”

PythonNumPyPandasSciPyScikit-learnSeaborn+186
View profile
MV

Manish Vemula

Screened

Mid-level Machine Learning Engineer specializing in real-time pipelines and NLP/GenAI

TX, USA4y exp
DiscoverCentral Michigan University

“ML/MLOps practitioner from Discover Financial who built and deployed a real-time AI fraud detection platform (LSTM + VAE) on AWS SageMaker with Docker/FastAPI and Jenkins-driven CI/CD. Demonstrated measurable impact (30% accuracy lift, 25% fewer false alerts) and deep expertise in class-imbalance mitigation, drift monitoring, and orchestration (Airflow/Kubeflow), plus strong stakeholder adoption via Power BI dashboards for fraud/compliance teams.”

AgileAnomaly DetectionAPI IntegrationAWS LambdaAzure Machine LearningCI/CD+101
View profile
VV

Vikas Velagapudi

Screened

Mid-level Software Engineer specializing in Machine Learning and LLMs

Atlanta, USA4y exp
Pelican IT GroupGeorge Mason University

“Software engineer with robotics and ML background (BS Software Engineering w/ Robotics minor; MS CS w/ ML minor) who built autonomy-focused student robotics projects combining RFID + camera sensing, path planning (Dijkstra), and fuzzy logic, and experimented with neural-network approaches. Also brings production-grade software practices from a Dell software analyst role, emphasizing maintainability, documentation, and testing for real-time systems.”

Backend DevelopmentCCSSData PreprocessingDebuggingFlask+64
View profile
SS

Shimil Shijo

Screened

Senior AI Software Engineer specializing in Generative AI and NLP

Dearborn, MI6y exp
University of Michigan-DearbornUniversity of Michigan-Dearborn

“Built and deployed a production multimodal language translation platform (text-to-text, speech-to-text, text-to-speech) using fine-tuned pretrained models (NLLB, XLSR), MLflow-orchestrated pipelines, and Docker/Kubernetes on AWS. Worked closely with non-technical linguists to tackle data cleaning and dialect variation in minority languages, improving accuracy through consistent evaluation and monitoring.”

PythonCC++RJavaNumPy+79
View profile
DD

Dhairya Desai

Screened

Senior AI/ML Engineer specializing in healthcare NLP and predictive analytics

Chicago, IL13y exp
OptumUniversity of Texas at Dallas

“ML/NLP engineer with healthcare and industrial IoT experience: built an Optum pipeline that converted 2M+ physician notes into structured entities and linked them with claims/pharmacy data to create an actionable patient timeline. Deep hands-on expertise in production NER, entity resolution, and hybrid search (Elasticsearch + embeddings/FAISS), plus robust data engineering practices (Airflow, Spark, data contracts, auditability) and experimentation-to-production rollout via shadow mode and feature flags.”

PythonRSQLMATLABCC#+157
View profile
PS

Ponugoti Sushma

Screened

Mid-level Machine Learning Engineer specializing in IoT, edge AI, and enterprise ML

Texas, USA5y exp
AllstateTexas A&M University-Corpus Christi

“Built and productionized an LLM/RAG question-answering service over technical documentation, focusing on retrieval quality (reranking + IR metrics), latency, and scaling. Experienced orchestrating end-to-end ETL/ML workflows with Airflow/Prefect/AWS Step Functions and improving reliability via parallelism, retries, and shadow testing. Also delivered an explainable healthcare risk-flagging classifier with a stakeholder-friendly dashboard for a non-technical program manager.”

PythonCC++TensorFlowPyTorchScikit-learn+134
View profile
TW

Tejaswini Waghmare

Screened

Senior Data Analytics & Data Science professional specializing in Financial Services

4y exp
InfosysGeorgia State University

“Worked on large financial analytics datasets combining complaint text, transaction logs, and demographics; built end-to-end NLP/ML pipelines (TF-IDF + Random Forest) and data integration in BigQuery with Tableau reporting, citing ~95–98% accuracy. Also implemented entity resolution with fuzzy matching and semantic linking using BERT sentence-transformer embeddings stored in FAISS, including fine-tuning on labeled pairs to improve search/linking relevance.”

SQLXMLMySQLPythonRBigQuery+109
View profile
AG

Amie Gibson

Screened

Senior Geospatial Developer specializing in GIS automation, elevation/LiDAR, and AI-enabled apps

Sand Springs, OK27y exp
FEMAFlorida Institute of Technology

“Built and monetized an object-identification app end-to-end (FastAPI backend, HTML/JS frontend, SQLite→Postgres, auth, and an iOS wrapper via Capacitor/Xcode with Apple privacy/policy compliance). Also productionized an AI-native geospatial metadata/QA assistant using LLM+RAG plus deterministic Python validation, measuring impact via time-to-first-pass review and rework rate, and has experience modernizing legacy GIS workflows and delivering across USDA/FEMA-style teams with disciplined Jira-based execution.”

AgileAPI IntegrationAWSBashC#C+++111
View profile
AA

Amogh Arya Munipalle

Screened

Junior Software Engineer specializing in cloud, DevOps, and applied AI security

West Lafayette, Indiana3y exp
Freight PinsPurdue University

“Founding engineer who built a multi-tenant AWS backend from scratch focused on ultra-fast, configuration-driven client onboarding and low operational cost. Automated tenant provisioning/deployments with Terraform + GitHub Actions (new client infra in ~13 minutes) and scaled to 62 production clients handling ~75k requests/day without a major rewrite. Hands-on with migrations (DynamoDB->MongoDB), reliability/observability, and performance tuning (indexes, Redis, queueing, connection management).”

API DevelopmentAuthenticationAWSAWS IAMAWS LambdaAWS Step Functions+145
View profile
ST

Sravya Thotakuri

Screened

Mid-level Full-Stack Developer specializing in Healthcare and FinTech web applications

Remote, USA4y exp
Fairview Health ServicesUniversity of Dayton

“Hands-on engineer focused on productionizing LLM-powered assistants: builds RAG pipelines with guardrails, response schemas, and citation-grounded outputs, then hardens them with explicit NFRs (latency, uptime, security, cost). Experienced diagnosing agentic/LLM workflow issues in real time using observability and stepwise isolation, and supports go-to-market via developer demos, workshops, and pre-sales technical evaluations in microservices/Spring Boot environments.”

ReactAngularSpring BootJavaJavaScriptNode.js+121
View profile
BS

Bharath Simha Reddy Kothapeta

Screened

Full-Stack Software Engineer specializing in Java, React, and AWS

Plano, TX3y exp
Progress SolutionsNorthwest Missouri State University

“Backend-focused Python engineer who builds modular Flask services on AWS and specializes in performance/scalability work across data-heavy APIs. Has concrete wins in query optimization (1.5s to <200ms) and high-throughput async processing (Celery+Redis, ~40% throughput gain), plus experience serving scikit-learn text classification models via containerized REST services and designing multi-tenant data isolation strategies.”

AgileAmazon CloudWatchAmazon EC2Amazon ECSAmazon RedshiftAmazon S3+117
View profile
TS

Tanmay Sharma

Screened

Mid-level Backend Software Engineer specializing in microservices and AI/ML

Chandigarh, India3y exp
Excellence EducationUniversity at Buffalo

“JavaScript engineer with open-source experience on a database visualization library, focused on real-time rendering performance for large datasets (virtualized DOM rendering, requestAnimationFrame/debouncing, memoization) and on raising project quality via tests and CI performance benchmarks. Also built Kafka-based messaging documentation and sample producer/consumer apps to speed onboarding, and has experience diagnosing production issues including concurrency-related duplicate data problems.”

AgileAmazon S3Apache KafkaAPI DevelopmentAWSAWS Lambda+99
View profile
KG

Krithika GandlurMurali

Screened

Mid-Level Forward Deployed AI Engineer specializing in RAG systems and backend microservices

Austin, TX4y exp
SequretekStevens Institute of Technology

“LLM solutions practitioner with SOC/alert-triage experience who takes LLM prototypes to production using RAG (Pinecone), FastAPI services, guardrails, CI/CD, monitoring, and robust fallback logic. Known for rapid real-time debugging of embedding/vector and agent workflow issues, and for driving adoption through code-first workshops and sales-aligned custom demos with measurable improvements (35% faster triage; 40% increase in correct tool usage).”

PythonFastAPIRetrieval-Augmented Generation (RAG)Prompt engineeringOpenAI APIEmbeddings+85
View profile
1...222324...37

Related

Machine Learning EngineersSoftware EngineersData ScientistsResearch AssistantsAI EngineersSoftware DevelopersAI & Machine LearningEngineeringData & AnalyticsEducation

Need someone specific?

AI Search