Publications
A selection of publications/presentations from the Center for Digital Research and Scholarship (CDRS)
We investigate how computational methods can transform digital collections into structured evidence for scholarship and institutional analysis.
Evaluating the Impact of Automated Labeling on Retrieval Instability in Neural IR
William A. Ingram. 2025. “Evaluating the Impact of Automated Labeling on Retrieval Instability in Neural IR.” In Proceedings of the 48th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR ’25), Padua, Italy, pp. 4209. Doctoral Consortium Paper. 10.1145/3726302.3730128
Building datasets to support information extraction and structure parsing from electronic theses and dissertations
William A. Ingram, Jian Wu, Sampanna Yashwant Kahu, Javaid Akbar Manzoor, Bipasha Banerjee, Aman Ahuja, Muntabir Hasan Choudhury, Lamia Salsabil, Winston Shields, and Edward A. Fox. 2024. “Building datasets to support information extraction and structure parsing from electronic theses and dissertations.” International Journal on Digital Libraries, Vol. 25 (2), pp. 175–196. 10.1007/s00799-024-00395-4
ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations
Muntabir Hasan Choudhury, Lamia Salsabil, William A. Ingram, Edward A. Fox, and Jian Wu. 2024. “ETDPC: A Multimodality Framework for Classifying Pages in Electronic Theses and Dissertations.” In Thirty-Eighth AAAI Conference on Artificial Intelligence (AAAI 2024), Vancouver, Canada, pp. 22878–22884. 10.1609/AAAI.V38I21.30324
Automating Chapter-Level Classification for Electronic Theses and Dissertations
Bipasha Banerjee, William A. Ingram, and Edward A. Fox. 2024. “Automating Chapter-Level Classification for Electronic Theses and Dissertations.” In 2024 IEEE International Conference on Big Data (BigData ’24), Washington, DC, USA, pp. 2400–2409. As part of The 7th Computational Archival Science (CAS) Workshop. 10.1109/BigData62323.2024.10825418
Nuclear Pore Segmentation in 3D FIB-SEM Images with Dynamic Cyclical Data Augmentation
Chongyu He, Zhiwu Xie, Yinlin Chen, and Edward A. Fox. 2024. “Nuclear Pore Segmentation in 3D FIB-SEM Images with Dynamic Cyclical Data Augmentation.” In Proceedings of the IEEE International Conference on Big Data (BigData 2024), pp. 1972–1977. 10.1109/BigData62323.2024.10825445.
Searching for studies: A guide to information retrieval for Campbell systematic reviews
Heather MacDonald, Cozette Comer, Margaret Foster, Patrick R. Labelle, Scott Marsalis, Kate Nyhan, Zahra Premji, Morwenna Rogers, Ryan Splenda, Claire Stansfield, and Sarah Young. 2024. “Searching for studies: A guide to information retrieval for Campbell systematic reviews.” Campbell Systematic Reviews, first published 10 September 2024. 10.1002/cl2.1433.
Constructing Consistent Comprehensive Searches in Large Engineering Databases—Tips and Recommendations for Literature Reviews
Sarah Over and C. Cozette Comer. 2024. “Constructing Consistent Comprehensive Searches in Large Engineering Databases—Tips and Recommendations for Literature Reviews.” Proceedings of the American Society for Engineering Education (ASEE) 2024 Annual Conference & Exposition, June 23, 2024. 10.18260/1-2--47068.
Digitizing Metadata of a University Fashion Collection’s Holdings Using OCR and Costume Core
Dina Smith-Glaviana, Wen Nie Ng, Chreston Miller, and Julia Spencer. 2024. “Digitizing Metadata of a University Fashion Collection’s Holdings Using OCR and Costume Core.” Journal of Library Metadata, Vol. 24 (2), pp. 57–86. 10.1080/19386389.2024.2303849.
Automatic Expansion of Metadata Standards for Historic Costume Collections
Caleb McIrvin, Chreston Miller, Dina Smith-Glaviana, and Wen Nie Ng. 2024. “Automatic Expansion of Metadata Standards for Historic Costume Collections.” Journal of eScience Librarianship, Vol. 13 (1), e845. 10.7191/jeslib.845.
Integrated Digital Library System for Long Documents and their Elements
Satvik Chekuri, Prashant Chandrasekar, Bipasha Banerjee, Sung Hee Park, Nila Masrourisaadat, Aman Ahuja, William A. Ingram, and Edward A. Fox. 2023. “Integrated Digital Library System for Long Documents and their Elements.” In Proceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries (JCDL ’23), Santa Fe, New Mexico, USA, pp. 13–24. Nominated for Best Student Paper Award. 10.1109/JCDL57899.2023.00012.
A New Annotation Method and Dataset for Layout Analysis of Long Documents
Aman Ahuja, Kevin Dinh, Brian Dinh, William A. Ingram, and Edward Fox. 2023. “A New Annotation Method and Dataset for Layout Analysis of Long Documents.” In Companion Proceedings of the ACM Web Conference 2023 (WWW ’23 Companion), Austin, TX, USA, pp. 834–842. As part of 3rd International Workshop on Scientific Knowledge Representation, Discovery, and Assessment (Sci-K 2023). 10.1145/3543873.3587609.
DevOps Practices in Digital Library Development
Yinlin Chen. 2022. “DevOps Practices in Digital Library Development.” Proceedings of the 22nd ACM/IEEE Joint Conference on Digital Libraries (JCDL ’22), Cologne, Germany, Article No. 38, pp. 1–4. 10.1145/3529372.3533284.