Enhancing Retrieval for ESGLLM via ESG-CID–A Disclosure Content Index Finetuning Dataset for Mapping GRI and ESRS

Published in arXiv preprint arXiv:2503.10674, 2025

Environmental, Social, and Governance (ESG) reporting requires mapping disclosures across different frameworks such as GRI (Global Reporting Initiative) and ESRS (European Sustainability Reporting Standards). This paper introduces ESG-CID, a Disclosure Content Index finetuning dataset designed to enhance retrieval capabilities for ESG-focused language models. Our dataset enables better mapping between different sustainability reporting frameworks, facilitating more accurate and efficient ESG data analysis.

Recommended citation: @article{ahmed2025enhancing, title={Enhancing Retrieval for ESGLLM via ESG-CID--A Disclosure Content Index Finetuning Dataset for Mapping GRI and ESRS}, author={Ahmed, Shafiuddin Rehan and Shah, Ankit Parag and Tran, Quan Hung and Khetan, Vivek and Kang, Sukryool and Mehta, Ankit and Bao, Yujia and Wei, Wei}, journal={arXiv preprint arXiv:2503.10674}, year={2025} }
Download Paper