IEEE VIS 2025 Content: ViStruct: Simulating Expert-Like Reasoning Through Task Decomposition and Visual Attention Cues

ViStruct: Simulating Expert-Like Reasoning Through Task Decomposition and Visual Attention Cues



 Oliver Huang -

 Carolina Nobre -

 Screen-reader Accessible PDF

 Download preprint PDF

 Download Supplemental Material

Room: Hall E2

2025-11-06T08:39:00.000ZGMT-0600Change your timezone on the schedule page
2025-11-06T08:39:00.000Z

Recorded video from this session can be viewed at the following link.
https://youtu.be/-r69u3gOTsk

Keywords

Data Visualization, Task Decomposition, Large Language Models(LLMs), Guidance System, Computer Vision

Abstract

Data visualization tasks often require multi-step reasoning, and the interpretive strategies experts use-such as decomposing complex goals into smaller subtasks and selectively attending to key chart regions-are rarely made explicit. ViStruct is an automated pipeline that simulates these expert behaviours by breaking high-level visual questions into structured analytic steps and highlighting semantically relevant chart areas. Leveraging large language and vision-language models, ViStruct identifies chart components, maps subtasks to spatial regions, and presents visual attention cues to externalize expert-like reasoning flows. While not designed for direct novice instruction, ViStruct provides a replicable model of expert interpretation that can inform the development of future visual literacy tools. We evaluate the system on 45 tasks across 12 chart types and validate its outputs with trained visualization users, confirming its ability to produce interpretable and expert-aligned reasoning sequences.