09.02.2026 | Neues Paper in MAKE Special Issue



Ein Paper wurde in der Special Issue "Using Large Language Models for Scientific Problem Solving and Engineering Design" in der Zeitschrift Machine Learning and Knowledge Extraction (MAKE) veröffentlicht.

Lars Wilhelmi, Christian Bruns und Matthias Schumann publizierten den Beitrag:
Enhancing the Extraction of GHG Emission-Reduction Targets from Sustainability Reports Using Vision Language Models



Abstract: This study investigates how Vision Language Models (VLMs) can be used and methodically configured to extract Environmental, Social, and Governance (ESG) metrics from corporate sustainability reports, addressing the limitations of existing text-only and manual ESG data-extraction approaches. Using the Design Science Research Methodology, we developed an extraction artifact comprising a curated page-level dataset containing greenhouse gas (GHG) emission-reduction targets, an automated evaluation pipeline, model and text-preprocessing comparisons, and iterative prompt and few-shot refinement. Pages from oil and gas sustainability reports were processed directly by VLMs to preserve visual–textual structure, enabling a controlled comparison of text, image, and combined input modalities, with extraction quality assessed at page and attribute level using F1-scores. Among tested models, Mistral Small 3.2 demonstrated the most stable performance and was used to evaluate image, text, and combined modalities. Combined text + image modality performed best (F1 = 0.82), particularly on complex page layouts. The findings demonstrate how to effectively integrate visual and textual cues for ESG metric extraction with VLMs, though challenges remain for visually dense layouts and avoiding inference-based hallucinations.