- Built end-to-end multi-omics analysis pipelines across bulk RNA-seq, scRNA-seq, GWAS and proteomic datasets — including Seurat batch integration, UMAP visualization, differential expression analysis and functional annotation across disease indications.
- Applied
Geneformertransformer-based foundation model (~30M single-cell transcriptomes) andCytetypefor AI-assisted automated cell type annotation and 256-dim UMAP-based classification. - Delivered publication-quality visualizations — Heatmaps, oncoplots, bar/box/scatter/density/PCA plots — using modular, reusable R and Python scripts.
- Designed and deployed a production-ready
Streamlitapplication withOpenSearch APIintegration, ontology-based search with synonym support, adaptive metadata filtering, pagination and one-click dataset download. - Developed automated data acquisition pipelines integrating
Qiagen OmicsLand API, GEO and S3 bucket downloads with metadata schema extraction and preprocessing workflows. - Worked on R Shiny applications for biomedical data workflows, focusing on session handling, user tracking and streamlined deployment using
renvenvironments. - Designed modular SQL workflows across
PostgreSQLandMySQL— query optimization via temporary tables, JOIN restructuring and data collapsing on large-scale database handling. - Performed data scouting and curation across rare disease indications from public biomedical sources including
NCBI GEO,ArrayExpress, PRIDE and metabolomics databases. - Maintained reproducible, containerized analysis environments using
Docker,renvand virtual environments with version-controlled pipelines.
- Developed RShiny applications and data visualization projects using R and Python programming languages and Shiny libraries for biomedical data analysis and visualization in B2B SaaS products.
- Integrated Spotfire in-built visualizations with custom visualizations from JSViz framework, Spotfire MODS, Plotly.js, RShiny, and JavaScript-based Text Area scripts.
- Utilized Spotfire's data functions and programming capabilities in R and Python for data transformation.
- Created complex SQL queries through custom queries and information links for data ingestion.
- Built a collection of reusable IronPython scripts stored within DXP for streamlined development.
- Worked as a Spotfire developer, creating Spotfire dashboards for biological data using IronPython, R, SQL, Python, JavaScript, HTML, and CSS.
- Successfully contributed to multiple high-visibility projects by developing reports (using Rmarkdown) that facilitated data-driven decision-making for businesses.
- Worked on client-facing projects involving MongoDB, pipeline development, and Docker for efficient data processing and analysis.
- Developed pipelines and Docker containers to streamline deployment and ensure scalability across environments.
- Hands-on training in Dockers, Pandas, Python Data-Analysis, R, and R Shiny.
- Developed a dashboard for easy querying, visualization, and report generation.
- Self-paced training through Udemy courses.
- Involved in in-house industrial project based on R language, SQL and the Shiny framework.
- The dashboard developed enables easy querying, allowing users to search, visualise, audit, and generate reports for an overview of many facets of datasets using a web-based GUI.
- Answered questions asked by students globally across biology and bioinformatics domains.
- Followed quality parameters to solve Q&A with academic integrity.
- Ensured that the guidelines of answering are strictly followed.
- Trained students for SAT, GRE, and TOEFL examinations.
- Advised students on education opportunities, application procedures, visa applications, and other documentation for study abroad.
- Liaised with students, other offices, and client institutions.
- Assisted with the general running of the office to ensure smooth operations.
A collection of generative-art animations created using Plotnine, exploring the intersection of data visualization and creative coding. This submission earned the Runner-Up position, standing out for its artistic interpretation of data.
An interactive table design focused on clarity, usability, and aesthetic presentation. This submission received an Honorable Mention, highlighting effective communication of structured data through modern table design techniques.
A talk on building dynamic and reproducible reports using Quarto, combining R and Python within a single document, enabling interactive exploration with Observable JS, and incorporating scrollytelling techniques with the CloseRead extension for more effective and transparent data communication.
A hands-on learning challenge focused on building AI-powered applications using Streamlit. This project documents daily explorations, covering practical implementations of machine learning, generative AI, and interactive app development.
This project presents meteorological data for various Indian states, featuring interactive tables and visualizations using ApexCharts and Toast UI, inspired by the aesthetics of Kung Fu Panda.
A showcase of three distinct entries in the Plotnine contest, highlighting the versatility of data visualization through artistic expressions and scientific analysis.
Embark on an intergalactic journey with our interactive Shiny dashboard! Explore key statistics of legendary characters, visualize the balance of the Force, and dive into the rich tapestry of Star Wars communities. This dashboard brings the epic saga to life through engaging data visualization.