Publications

publications by categories in reversed chronological order.

2025

  1. WAT, IJCNLP-AACL
    wat_vision_judge.png
    A Picture is Worth a Thousand (Correct) Captions: A Vision-Guided Judge-Corrector System for Multimodal Machine Translation
    Siddharth Betala, Kushan Raj, Vipul Betala, and 1 more author
    In The 12th Workshop on Asian Translation (WAT) at IJCNLP-AACL, 2025
  2. NeurIPS AI4Mat
    lemat_genbench.png
    LeMat-GenBench: A Unified Evaluation Framework for Crystal Generative Models
    Siddharth Betala, Samuel P. Gleason, Ali Ramlaoui, and 12 more authors
    In AI for Accelerated Materials Design (AI4Mat) Workshop, NeurIPS, 2025
    Workshop paper, to be submitted to Nature Computational Science
  3. NeurIPS AI4Mat
    lemat_synth.png
    LeMat-Synth: a multi-modal toolbox to curate broad synthesis procedure databases from scientific literature
    Magdalena Lederbauer, Siddharth Betala, Xiyao Li, and 16 more authors
    In AI for Accelerated Materials Design (AI4Mat) Workshop, NeurIPS, 2025
    Workshop paper, to be submitted to Digital Discovery

2024

  1. NeurIPS WiML
    ood-gnn.png
    Out-of-Distribution performance as a proxy metric for graph neural network explainers in the absence of ground-truth explanations
    Siddharth Betala, Guadalupe Gonzalez, and Chirag Agarwal
    In Women in Machine Learning (WiML) Workshop, NeurIPS, 2024
    Workshop poster, presented by Guadalupe Gonzalez
  2. WMT, EMNLP
    llm_captioning.png
    Brotherhood at WMT 2024: Leveraging LLM-Generated Contextual Conversations for Cross-Lingual Image Captioning
    Siddharth Betala and Ishan Chokshi
    In Ninth Conference on Machine Translation (WMT) at EMNLP, 2024
  3. EMNLP Main
    deidentification.png
    De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP
    Stefan Larson, Nicole Cornehl Lima, Santiago Pedroza Diaz, and 9 more authors
    In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
  4. MLCB
    protein_diffusion.png
    Screening Protein Sequences Generated via Conditional Diffusion for Enhanced Fitness using a GNN-based Function Predictor
    Siddharth Betala, Zhiqing Xu, Rana Ahmed Barghout, and 2 more authors
    In Machine Learning for Computational Biology (MLCB), 2024
    Poster presentation

2023

  1. COBIOT
    enzyme_design.png
    Advances in generative modeling methods and datasets to design novel enzymes for renewable chemicals and fuels
    Rana A. Barghout, Zhiqing Xu, Siddharth Betala, and 1 more author
    Current Opinion in Biotechnology, 2023