Portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 1
Short description of portfolio item number 2 
Jin-Ting Ding, Hang-Yao Tu, Ze-Lin Zang, Min Huang, Sheng-Jun Zhou, Computers and electronics in agriculture, 2018
Zelin Zang, Wanliang Wang, Yuhang Song, Linyan Lu, Weikun Li, Yule Wang, Yanwei Zhao, Computational intelligence and neuroscience, 2019, First Author
Guoqi Chen, Wanliang Wang, Zheng Wang, Honghai Liu, Zelin Zang, Weikun Li, Applied Intelligence, 2020
Stan Z. Li, Zelin Zang, Lirong Wu, arXiv preprint arXiv:2006.08256, 2020, Co-first Author
Fangfei Zhang, Shaoyang Yu, Lirong Wu, Zelin Zang, Xiao Yi, Jiang Zhu, Cong Lu, Ping Sun, Yaoting Sun, Sathiyamoorthy Selvarajan, et al., Journal of the American Society for Mass Spectrometry, 2020
Di Wu, Siyuan Li, Zelin Zang, Kai Wang, Lei Shang, Baigui Sun, Hao Li, Stan Z. Li, arXiv preprint arXiv:2106.15788, 2021
Kai Zhou, Yaoting Sun, Lu Li, Zelin Zang, Jing Wang, Jun Li, Junbo Liang, Fangfei Zhang, Qiushi Zhang, Weigang Ge, et al., Computational and structural biotechnology journal, 2021, Co-first Author
Lirong Wu, Zicheng Liu, Jun Xia, Zelin Zang, Siyuan Li, Stan Z. Li, IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2022
Zelin Zang, Siyuan Li, Di Wu, Jianzhu Guo, Yongjie Xu, Stan Z. Li, Neurocomputing, 2022, CCF-B, First Author
Yaoting Sun, Sathiyamoorthy Selvarajan, Zelin Zang*, Wei Liu, Yi Zhu, Hao Zhang, Wanyuan Chen, Hao Chen, Lu Li, Xue Cai, et al., Cell Discovery, 2022, IF=38, Co-first Author
Zelin Zang, Siyuan Li, Di Wu, Ge Wang, Kai Wang, Lei Shang, Baigui Sun, Hao Li, Stan Z. Li, European Conference on Computer Vision (ECCV), 2022, CCF-A, First Author
Feiyang Guo, Linyan Lu, Zelin Zang, Mohammad Shikh-Bahaei, IEEE Open Journal of the Communications Society, 2023
Siyuan Li, Di Wu, Fang Wu, Zelin Zang, Stan Li, International Conference on Machine Learning (ICML23), 2023
Kailong Zhao, Yuhao Xia, Fujin Zhang, Zelin Zang, Stan Z. Li, Guijun Zhang, Communications biology, 2023
Yongjie Xu*, Zelin Zang*, et al., Communications Biology, 2023, IF=6.5, Co-first Author
Zelin Zang, Yongjie Xu, Linyan Lu, Yulan Geng, Senqiao Yang, Stan Z. Li, Neural Networks, 2023, CCF-B, First Author
Zelin Zang, Lei Shang, Senqiao Yang, Fei Wang, Baigui Sun, Xuansong Xie, Stan Z. Li, International Conference on Computer Vision (ICCV), 2023, CCF-A, Oral, First Author
Zelin Zang, Shenghui Cheng, Hanchen Xia, Liangyu Li, Yaoting Sun, Yongjie Xu, Lei Shang, Baigui Sun, Stan Z. Li, IEEE Transactions on Visualization and Computer Graphics (TVCG), 2024, CCF-A, First Author
Sitao Luan, Chenqing Hua, Qincheng Lu, Liheng Ma, Lirong Wu, Xinyu Wang, Minkai Xu, Xiao-Wen Chang, Zelin Zang, Doina Precup, et al., arXiv preprint arXiv:2407.09618, 2024
Kai Wang, Dongwen Tang, Boya Zeng, Yida Yin, Zhaopan Xu, Yukun Zhou, Zelin Zang, Trevor Darrell, Zhuang Liu, Yang You, arXiv preprint arXiv:2402.13144, 2024
Zelin Zang, Hao Luo, Kai Wang, Panpan Zhang, Fan Wang, Stan Li, Yang You, International Conference on Machine Learning (ICML), 2024, CCF-A, First Author
Chenrui Duan*, Zelin Zang*, et al., Neural Information Processing Systems (NeurIPS), 2024, CCF-A, Co-first Author
Zelin Zang, et al., Information Fusion, 2025, CCF-A, IF=15.6, First Author
Zelin Zang, et al., Briefings in Bioinformatics, 2025, CCF-B, AI4SCI, First Author
Zelin Zang, et al., IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2025, CCF-B, Corresponding Author
Zelin Zang, et al., IEEE ICASSP, 2025, CCF-B, Corresponding Author
Zelin Zang, et al., IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2026, CCF-A, IF=20, First Author
Zelin Zang, et al., Journal of Machine Learning Research (JMLR), 2026, CCF-A, First Author
Yaoting Sun*, Zelin Zang*, et al., Nature Communications, 2026, Top Journal, Co-first Author
Zelin Zang, et al., Nature Machine Intelligence (Under Review), Top Journal, First Author
Gaoyang Luo*, Zelin Zang*, et al., Nature Methods (Under Review), Top Journal, Co-first Author
Zelin Zang, et al., ICLR 2026 (Under Review), CCF-A, First Author, Scores: 6,6,10
Zelin Zang, et al., IEEE Transactions on Visualization and Computer Graphics (TVCG), 2026, CCF-A, Corresponding Author
Zelin Zang, et al., AAAI Conference on Artificial Intelligence (AAAI), 2026, CCF-A, Oral, Corresponding Author
Zelin Zang, et al., AAAI Conference on Artificial Intelligence (AAAI), 2026, CCF-A, Oral, First Author
Published:
Presented cutting-edge research on deep manifold learning methods for high-dimensional data visualization. This talk covered the DMTEV framework, which combines topology-preserving dimensionality reduction with explainable AI techniques. Discussed applications in single-cell genomics, medical imaging, and complex data analysis, demonstrating how deep manifold transformation enables interpretable visualization of complex biological and clinical datasets while preserving intrinsic data structures.
Published:
Comprehensive review presentation on AI for Science research progress, covering interdisciplinary applications of deep learning in biology, medicine, and physical sciences. Highlighted achievements in protein structure prediction, drug discovery, molecular dynamics simulation, and single-cell analysis. Discussed the integration of foundation models, geometric deep learning, and physics-informed neural networks for advancing scientific discovery. Presented future directions including AI-driven virtual cell construction and multi-omics data integration.
Published:
Detailed research presentation on novel diffusion-based data augmentation methods for unsupervised contrastive learning. Introduced the DiffAug framework that leverages diffusion models to generate semantically consistent augmented views from scratch, addressing the limitations of traditional augmentation techniques. Demonstrated superior performance on image representation learning tasks, showing significant improvements in downstream classification, detection, and segmentation tasks. This work was later published at ICML 2024.
Published:
Invited lecture at NUS HPC-AI Lab during visiting scholar period under Prof. Yang You’s supervision. Presented research on scalable deep learning systems, distributed training optimization, and efficient model parallelism. Covered advanced topics including gradient compression, mixed-precision training, and memory-efficient attention mechanisms. Discussed applications of HPC-AI in large-scale biological data analysis, particularly for single-cell genomics and protein structure prediction. Explored future directions in federated learning and edge computing for life sciences.
Published:
Comprehensive presentation on geometric deep learning applications in drug discovery and single-cell data analysis. Discussed graph neural networks for molecular property prediction, protein-ligand binding affinity estimation, and structure-based drug design. Presented novel approaches for learning on cellular graph representations, including cell-cell interaction networks and spatial transcriptomics data. Highlighted the integration of multi-modal biological data (genomics, proteomics, metabolomics) through geometric learning frameworks. Demonstrated state-of-the-art results on drug-target interaction prediction and cell type annotation tasks.
Published:
Presented collaborative research on AI-driven proteomics analysis for cancer classification and prognosis prediction. Introduced deep learning frameworks that integrate mass spectrometry data, clinical information, and multi-omics profiles for personalized cancer medicine. Discussed novel attention mechanisms for identifying prognostic protein biomarkers and pathway-level features. Demonstrated applications in thyroid cancer, medullary carcinoma, and other malignancies. Covered the development of interpretable models that provide biological insights into cancer mechanisms while achieving superior predictive performance compared to traditional statistical methods.
Published:
Academic job interview presentation covering comprehensive research portfolio in AI for biological sciences and medical applications. Presented major achievements including deep manifold learning for high-dimensional data visualization, foundation models for single-cell analysis, and AI-driven drug discovery platforms. Discussed future research directions including virtual cell construction, interpretable AI for medicine, and multi-modal biological data integration. Outlined collaborative opportunities with clinical and biological research groups, and proposed innovative approaches for translating AI research into clinical practice.
Published:
Poster presentation at ICML 2024 showcasing the DiffAug framework for unsupervised contrastive learning. Demonstrated how diffusion models can generate high-quality augmented views that preserve semantic content while introducing beneficial variations. Engaged with leading researchers in self-supervised learning, discussing the theoretical foundations of diffusion-based augmentation and its advantages over traditional geometric and color transformations. Received positive feedback on the method’s generalizability across different domains including natural images, medical imaging, and scientific visualization. The work addresses fundamental challenges in learning robust visual representations without labeled data.
Published:
Internal technical discussion session focusing on challenges and best practices in training large-scale foundation models. Covered topics including distributed training strategies, gradient accumulation techniques, memory optimization, and hyperparameter tuning. Discussed infrastructure requirements for training billion-parameter models, including GPU cluster configuration, data pipeline optimization, and checkpoint management. Explored emerging techniques such as LoRA, quantization-aware training, and efficient attention mechanisms for reducing computational costs while maintaining model performance.
Published:
Presented innovative research on phylogenetic tree construction using deep learning approaches. Introduced the BioTree framework that combines language models, graph neural networks, and evolutionary algorithms for inferring phylogenetic relationships from molecular sequences. Discussed the MDTree (Masked Dynamic Autoregressive Model) for phylogenetic inference and its advantages over traditional maximum likelihood and Bayesian methods. Covered applications in viral evolution tracking, bacterial taxonomy, and cancer phylogenetics. Demonstrated significant improvements in accuracy and computational efficiency for large-scale phylogenetic studies.
Published:
Research preview presentation on building foundation models for single-cell analysis and constructing comprehensive cell knowledge bases. Introduced the concept of cell-level language models trained on millions of single-cell transcriptomic profiles. Discussed pre-training strategies, including masked cell prediction, contrastive learning on cell embeddings, and multi-task learning for cell type annotation, trajectory inference, and perturbation prediction. Covered the development of a unified cell atlas integrating data from multiple tissues, species, and technologies. Explored applications in automated cell annotation, novel cell type discovery, and understanding cellular states in disease.
Published:
Comprehensive overview of recent advances in Retrieval-Augmented Generation (RAG) systems, with emphasis on scientific and medical applications. Discussed state-of-the-art retrieval methods including dense retrievers, hybrid search, and learned sparse representations. Covered generation models fine-tuned for domain-specific knowledge synthesis. Presented case studies on applying RAG to medical diagnosis support, drug discovery literature mining, and scientific paper summarization. Explored challenges including retrieval quality, hallucination mitigation, and factual consistency. Demonstrated practical implementations using vector databases and efficient retrieval architectures for real-time knowledge-intensive tasks.
Published:
Forum presentation on advanced deep learning techniques for analyzing and visualizing high-dimensional scientific data. Covered dimensionality reduction methods including deep manifold learning, variational autoencoders, and diffusion-based approaches. Discussed challenges in preserving local and global structure, handling non-linear manifolds, and maintaining interpretability. Presented applications across diverse domains: single-cell genomics (tens of thousands of genes), medical imaging (volumetric data), and materials science (molecular dynamics). Demonstrated how topology-aware neural networks can reveal hidden patterns in complex datasets while providing biologically or physically meaningful representations.
Published:
Detailed presentation on automated cell type annotation and cell matching algorithms for single-cell transcriptomics. Introduced novel methods for transferring cell type labels across datasets using optimal transport, contrastive learning, and attention mechanisms. Discussed challenges in batch effect correction, rare cell type identification, and handling technical variability across sequencing platforms. Demonstrated the CellScout framework for interactive cell annotation with visual analytics support. Covered applications in atlas-scale cell annotation, cross-species cell mapping, and identifying disease-associated cell states through comparative analysis.
Published:
Invited lecture on the latest advances in medical foundation models and their applications in clinical practice. Presented the MedLA (Medical Logic-driven Agent) framework for interpretable diagnostic reasoning using large language models. Discussed multi-agent systems that integrate medical knowledge graphs, clinical guidelines, and patient data for comprehensive diagnosis support. Covered challenges in medical AI including reliability, explainability, and handling rare diseases. Demonstrated real-world case studies showing improved diagnostic accuracy and reduced physician cognitive load. Explored regulatory considerations and pathways for clinical deployment of AI diagnostic assistants.
Published:
Doctoral dissertation defense presenting comprehensive research on deep manifold learning methods for high-dimensional data analysis. The dissertation covered: (1) Theoretical foundations of topology-preserving dimensionality reduction using deep neural networks; (2) Novel architectures including DLME, DMTEV, and DMT-ME for explainable visualization; (3) Applications in single-cell genomics, medical imaging, and drug discovery; (4) Benchmarking studies demonstrating superior performance over classical methods. Committee members included leading experts in AI, computational biology, and medical informatics. Successfully defended with distinction, receiving recognition for significant contributions to both methodology and real-world applications.
Published:
Keynote address at the opening ceremony of national symposium on multi-modal biological data integration. Outlined the landscape of multi-modal learning challenges in modern biology, including integrating genomics, transcriptomics, proteomics, metabolomics, and imaging data. Discussed emerging architectures such as cross-modal transformers, multi-modal graph neural networks, and modality-specific encoders with shared latent spaces. Presented case studies on drug-cell interaction modeling, spatial transcriptomics analysis, and multi-omics disease prediction. Emphasized the need for standardized benchmarks, interpretability frameworks, and collaborative data sharing initiatives to advance the field.
Published:
Progress report on integrating single-cell proteomics with transcriptomics for comprehensive cellular profiling. Presented advances in CITE-seq data analysis, surface protein marker identification, and joint embedding of protein and RNA measurements. Discussed computational methods for handling the unique characteristics of protein data including sparsity, dynamic range, and antibody-specific noise. Demonstrated applications in immune cell profiling, cancer heterogeneity analysis, and identifying protein-level dysregulation not captured by transcriptomics. Covered emerging technologies like spatial proteomics and multi-parameter flow cytometry analysis using deep learning.
Published:
Workshop presentation on applying artificial intelligence to precision medicine and clinical decision-making. Covered the complete pipeline from multi-modal health data collection, quality control, and integration to predictive modeling and clinical deployment. Presented collaborative work with SANY Group on industrial health monitoring and with medical institutions on diagnostic AI systems. Discussed privacy-preserving machine learning techniques, federated learning for multi-center studies, and regulatory frameworks for medical AI. Demonstrated successful case studies including early disease detection, treatment response prediction, and personalized therapy recommendation systems.
Published:
Project proposal for developing AI-powered virtual cell models that simulate cellular behavior at unprecedented scale and resolution. Proposed integrating single-cell multi-omics data, protein interaction networks, metabolic pathways, and spatial organization into unified computational framework. Outlined three-phase approach: (1) Foundation model pre-training on billion-scale cellular datasets; (2) Physics-informed neural networks for modeling cellular dynamics; (3) Generative models for predicting cellular responses to perturbations. Discussed applications in drug discovery, synthetic biology, and understanding disease mechanisms. Outlined collaboration opportunities with experimental biologists and computational resources required for training and validation.
Published:
Comprehensive seminar on medical AI technologies and their translation to clinical practice. Presented work on foundation models for medical diagnosis, including the MedLA framework for interpretable clinical reasoning. Covered natural language processing for electronic health records, computer vision for medical imaging, and knowledge graph construction for clinical decision support. Discussed challenges in model validation, handling distribution shift between institutions, and ensuring patient safety. Demonstrated collaborative projects with Ant Group on developing scalable medical AI platforms, including diagnostic chatbots, automated report generation, and real-time clinical alert systems. Addressed ethical considerations and regulatory compliance for medical AI deployment.
Published:
Mid-term progress report for the Rootcloud AI Agentic System R&D project (¥400,000 funding). Presented achievements in developing end-to-end agentic AI system with integrated RAG, reinforcement learning, and multimodal understanding. Demonstrated: (1) Agentic RAG prototype achieving 60%+ retrieval recall and answer accuracy; (2) RL-optimized agent models with 5%+ improvement over baselines; (3) Multimodal context management reducing token usage by 50% while maintaining 80%+ accuracy. Showcased industrial applications in autonomous fault diagnosis, dynamic tool invocation, and multi-turn reasoning. Outlined remaining work including production deployment, scalability testing, and integration with enterprise systems.
Published:
Seminar on applying deep learning to enhance coverage and resolution in spatial transcriptomics data. Presented the MuST (Multimodal Structure Transformation) framework for unified spatial transcriptomics analysis. Discussed methods for imputing missing gene expression, super-resolution reconstruction of spatial maps, and integrating imaging with sequencing data. Covered attention-based architectures that learn spatial dependencies and cell-cell communication patterns. Demonstrated applications in tumor microenvironment analysis, developmental biology, and neuroscience. Showed how physics-inspired priors and biological constraints can improve model generalization and interpretability in spatial omics analysis.
Published:
Advanced research presentation on leveraging mass spectrometry proteomics data for cancer classification and prognosis. Presented deep learning frameworks that process raw DIA (Data-Independent Acquisition) mass spectrometry data for end-to-end prediction. Discussed novel architectures for handling peptide-level features, protein-protein interaction networks, and pathway enrichment simultaneously. Demonstrated superior performance in thyroid cancer subtyping, medullary carcinoma prognosis, and pan-cancer biomarker discovery. Covered collaborative work with clinical institutions validating AI models on prospective cohorts. Explored the potential of integrating proteomics with genomics and clinical data for personalized cancer treatment strategies.
Published:
Detailed technical proposal for implementing virtual cell foundation models at BGI. Outlined comprehensive architecture including: (1) Cell-level pre-training on 100M+ single-cell profiles from BGI’s biobank; (2) Multimodal integration of transcriptomics, proteomics, and morphology; (3) Hierarchical modeling of cell states, cell types, and developmental trajectories; (4) Causal inference modules for perturbation prediction. Discussed computational infrastructure requirements, data preprocessing pipelines, and quality control protocols. Proposed validation framework using experimental perturbation studies. Defined collaboration model between AI team, wet lab scientists, and clinical researchers for iterative model refinement and real-world testing.
Published:
Strategic planning session on multi-modal approaches to studying molecular dynamics and cellular behavior. Discussed integration of multiple data modalities including: structural biology (X-ray, cryo-EM), molecular dynamics simulations, experimental measurements (NMR, spectroscopy), and high-throughput screening data. Outlined research directions for developing unified frameworks that bridge time scales from femtoseconds (bond vibrations) to hours (cellular processes). Proposed combining physics-based molecular mechanics with data-driven deep learning models. Defined collaboration strategy with experimental groups, computational resource allocation, and publication roadmap. Emphasized applications in understanding drug-target interactions, protein conformational changes, and allosteric regulation mechanisms.
Graduate course, Zhejiang University / Westlake University, 2021
Served as Teaching Assistant for Prof. Stan Z. Li’s Deep Learning course during Ph.D. studies. This graduate-level course covered fundamental and advanced topics in deep learning, including neural network architectures, optimization methods, convolutional neural networks, recurrent neural networks, attention mechanisms, and generative models.
Graduate course, Zhejiang University / Westlake University, 2022
Continued as Teaching Assistant for Prof. Stan Z. Li’s Deep Learning course in the second semester. Building on the experience from the previous year, enhanced the course with updated content on recent advances and more challenging project assignments.