Virginia Tech® home

Publications

A selection of publications/presentations from the Center for Digital Research and Scholarship (CDRS)

We investigate how computational methods can transform digital collections into structured evidence for scholarship and institutional analysis.

2025

Evaluating the Impact of Automated Labeling on Retrieval Instability in Neural IR

William A. Ingram. 2025. “Evaluating the Impact of Automated Labeling on Retrieval Instability in Neural IR.” In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’25), Padua, Italy, pp. 4209. Doctoral Consortium Paper. 10.1145/3726302.3730128

2024

Building datasets to support information extraction and structure parsing from electronic theses and dissertations

William A. Ingram, Jian Wu, Sampanna Yashwant Kahu, Javaid Akbar Manzoor, Bipasha Banerjee, Aman Ahuja, Muntabir Hasan Choudhury, Lamia Salsabil, Winston Shields, and Edward A. Fox. 2024. “Building datasets to support information extraction and structure parsing from electronic theses and dissertations.” International Journal on Digital Libraries, Vol. 25 (2), pp. 175–196. 10.1007/s00799-024-00395-4

ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations

Muntabir Hasan Choudhury, Lamia Salsabil, William A. Ingram, Edward A. Fox, and Jian Wu. 2024. “ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations.” In Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, pp. 22878–22884. 10.1609/AAAI.V38I21.30324

Automating Chapter-Level Classification for Electronic Theses and Dissertations

Bipasha Banerjee, William A. Ingram, and Edward A. Fox. 2024. “Automating Chapter-Level Classification for Electronic Theses and Dissertations.” In 2024 IEEE International Conference on Big Data (BigData ’24), Washington, DC, USA, pp. 2400–2409. As part of The 7th Computational Archival Science (CAS) Workshop. 10.1109/BigData62323.2024.10825418

Nuclear Pore Segmentation in 3D FIB-SEM Images with Dynamic Cyclical Data Augmentation

Chongyu He, Zhiwu Xie, Yinlin Chen, and Edward A. Fox. 2024. “Nuclear Pore Segmentation in 3D FIB-SEM Images with Dynamic Cyclical Data Augmentation.” In Proceedings of the IEEE International Conference on Big Data (BigData 2024), pp. 1972–1977. 10.1109/BigData62323.2024.10825445.

Searching for studies: A guide to information retrieval for Campbell systematic reviews

Heather MacDonald, Cozette Comer, Margaret Foster, Patrick R. Labelle, Scott Marsalis, Kate Nyhan, Zahra Premji, Morwenna Rogers, Ryan Splenda, Claire Stansfield, and Sarah Young. 2024. “Searching for studies: A guide to information retrieval for Campbell systematic reviews.” Campbell Systematic Reviews, first published 10 September 2024. 10.1002/cl2.1433.

Constructing Consistent Comprehensive Searches in Large Engineering Databases—Tips and Recommendations for Literature Reviews

Sarah Over and C. Cozette Comer. 2024. “Constructing Consistent Comprehensive Searches in Large Engineering Databases—Tips and Recommendations for Literature Reviews.” Proceedings of the American Society for Engineering Education (ASEE) 2024 Annual Conference & Exposition, June 23, 2024. 10.18260/1-2--47068.

Digitizing Metadata of a University Fashion Collection’s Holdings Using OCR and Costume Core

Dina Smith-Glaviana, Wen Nie Ng, Chreston Miller, and Julia Spencer. 2024. “Digitizing Metadata of a University Fashion Collection’s Holdings Using OCR and Costume Core.” Journal of Library Metadata, Vol. 24 (2), pp. 57–86. 10.1080/19386389.2024.2303849.

Automatic Expansion of Metadata Standards for Historic Costume Collections

Caleb McIrvin, Chreston Miller, Dina Smith-Glaviana, and Wen Nie Ng. 2024. “Automatic Expansion of Metadata Standards for Historic Costume Collections.” Journal of eScience Librarianship, Vol. 13 (1), e845. 10.7191/jeslib.845.

2023

Integrated Digital Library System for Long Documents and their Elements

Satvik Chekuri, Prashant Chandrasekar, Bipasha Banerjee, Sung Hee Park, Nila Masrourisaadat, Aman Ahuja, William A. Ingram, and Edward A. Fox. 2023. “Integrated Digital Library System for Long Documents and their Elements.” In Proceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL ’23), Santa Fe, New Mexico, USA, pp. 13–24. Nominated for Best Student Paper Award. 10.1109/JCDL57899.2023.00012.

A New Annotation Method and Dataset for Layout Analysis of Long Documents

Aman Ahuja, Kevin Dinh, Brian Dinh, William A. Ingram, and Edward Fox. 2023. “A New Annotation Method and Dataset for Layout Analysis of Long Documents.” In Companion Proceedings of the ACM Web Conference 2023 (WWW ’23 Companion), Austin, TX, USA, pp. 834–842. As part of 3rd International Workshop on Scientific Knowledge Representation, Discovery, and Assessment (Sci-K 2023). 10.1145/3543873.3587609.

2022

DevOps Practices in Digital Library Development

Yinlin Chen. 2022. “DevOps Practices in Digital Library Development.” Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries (JCDL ’22), Cologne, Germany, Article No. 38, pp. 1–4. 10.1145/3529372.3533284.