Data Scientist & AI Researcher
/usr/bin/git | /usr/local/huggingface | /var/log/professional | /usr/share/scholar | /var/www/blog
/opt/kaggle | /usr/bin/leetcode | /var/log/social | /usr/share/videos | /var/www/medium
I'm an interdisciplinary AI professional with years of experience building production-grade LLM architectures, search, retrieval systems and end to end product development . Mathematics and Computer Science is my foundation. Conversational AI, Search, and exploring seemingly unrelated yet connected concepts are my playgrounds.
I have a strong background in mathematics and computer science with a dual degree from BITS Pilani. My interdisciplinary approach allows me to tackle problems from multiple angles, drawing insights from graph theory, topology, functional analysis, and computational methods.
My research focuses on developing novel interpretability metrics for LLMs to enhance transparency in decision-making.
All research areas align towards building scalable, interpretable, and production-grade LLM systems.
Workato (Bengaluru, India)
Jio Platforms Ltd (Hyderabad, India)
Amelia / IPSoft (Bengaluru, India)
Video Analytics Lab, IISc Bengaluru
BITS Pilani Goa Campus
Research under Dr. Jajati Keshari Sahoo exploring novel applications of game theory in ensemble learning:
Research under Dr. Tirtharaj Dash developing a novel reverse image search system:
Dual Degree: M.Sc Mathematics & B.E Computer Science
GPA: 7.47/10 (Equivalent to 3.7/4.0 US Scale)
Relevant Courses: Functional Analysis, Data Structures and Algorithms, Advanced Probability and Statistics, Graph Theory, Topology, Operations Research, Machine Learning, Quantum Computing
Image captioning-based image search engine: An alternative to retrieval by metadata
S. Iyer, S. Chaturvedi, T. Dash - Soft Computing for Problem Solving: SocProS 2017, Volume 2, 181-191
View on Google ScholarAI Assistant built on fine-tuned Mistral 7B with 8000-token context handling for enhanced understanding of complex conversations.
View DeydooGPT on GitHubMedical diagnosis system using Graph RAG techniques, achieving 85.83% production accuracy for clinical applications.
Proprietary ProjectHigh-performance inference system that reduced latency by 50% using ZeroMQ & NGINX for efficient handling of NLP requests.
View on GitHubA reverse image search engine powered by Elasticsearch and TensorFlow for advanced image recognition and retrieval.
View on GitHubCore technical member for this explainable AutoML platform for tabular data that automates model selection, hyperparameter tuning, and provides interpretable results.
Proprietary ProjectEducational platform with video content and intelligent recommendation system to personalize learning experiences.
View on GitHubRSS feed recommender using hybrid approach combining lexical and BERT-based content analysis for personalized news delivery.
View on GitHubMATLAB solver for Fractional Differential Equations with applications in mathematical modeling and simulations.
View on GitHubSystem that uses BERT and mathematical distance metrics to match concepts without prior training, boosting accuracy by 35%.
Proprietary ProjectA production-ready Python tool that enables natural conversations with YouTube videos using ASR, NLP, and LLMs. Features include automatic transcription, context-aware responses, and efficient caching.
View on CodebergA comprehensive 9-10 month roadmap transforming beginners into job-ready IT professionals. Features structured learning paths, hands-on projects, and industry-aligned curriculum covering CS fundamentals to advanced AI.
View on CodebergA production-grade RAG pipeline using txtai for semantic search and LLM orchestration. Features include dynamic index management, query classification, and lightning-fast retrievals (3s search, <2s retrieval). Includes real-world examples with multilingual support.
View on CodebergA chatbot application built using Python and AIML (Artificial Intelligence Markup Language) for natural language processing and pattern matching.
View Simple Chatbot on GitHubInteractive visualization of the Game of Thrones dataset using t-SNE dimensionality reduction, featured in a YouTube video for its innovative approach to data visualization.
View Game of Thrones Visualization on GitHubA novel optimization algorithm that uses prime number-based grid sampling to avoid aliasing problems common in regular grid search methods. Features dynamic resolution adaptation and domain shrinking around promising regions, with primes serving as resolution knobs for accuracy control.
View on GitHubA creative experimental habit tracker leveraging quantum physics, topology, and data analysis principles. Features include probabilistic habit representation, quantum machine learning for pattern analysis, behavioral topography visualization, and adaptive learning based on habit topology.
View on GitHubImplementation of list coloring problem using Grover's algorithm through SAT formulation, developed during IIT Delhi's Quantum Machine Learning course. Demonstrates quantum computing applications in graph theory and constraint satisfaction problems.
View on GitHubA cognitive architecture for LLMs featuring surprise-driven memory formation, Legendre polynomial-based orthogonal paragraph embeddings, and dynamic memory refinement. Implements temporal decay, access boosts, and contradiction detection for biologically-inspired memory management.
View on GitHubA research paper introducing an innovative approach to image search using captioning techniques instead of traditional metadata, published in SocProS 2017.
Authors: S. Iyer, S. Chaturvedi, T. Dash
Publication: Soft Computing for Problem Solving: SocProS 2017, Volume 2, 181-191
View on Google ScholarOngoing research in developing novel interpretability metrics for large language models to enhance transparency in decision-making and improve model trustworthiness.
Status: In progress
Research exploring innovative approaches to enhance long-text coherence in large language models through the integration of episodic memory mechanisms.
Status: In progress
Email: sethuiyer95@gmail.com