TY - CONF TI - The Lean Construction Visual Taxonomy (LCVT): bridging the semantic gap C1 - Singapore, Singapore C3 - Proceedings of the 34th Annual Conference of the International Group for Lean Construction (IGLC 34) SP - 14 EP - 25 PY - 2026 DO - 10.24928/2026/0151 AU - Sabek, Mohamed AU - Mei, Qipei AU - Lee, Gaang AU - Golabchi, Ali AU - Gonzalez, Vicente AD - PhD Candidate, Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Canada, sabek@ualberta.ca, orcid.org/0009-0005-2906-9874 AD - Assistant Professor, Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Canada, qipei@ualberta.ca, AD - Assistant Professor, Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Canada, gaang@ualberta.ca, AD - Adjunct Professor, Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Canada, alireza1@ualberta.ca, AD - Professor, Department of Civil and Environmental Engineering, University of Alberta, Edmonton, Canada, vagonzal@ualberta.ca, orcid.org/0000-0003-3408-3863 ED - Hamzeh, Farook ED - Poshdar, Mani ED - Garcia-Lopez,, Nelly P. AB - The architecture, engineering, and construction (AEC) industry faces productivity stagnation due to ineffective production flow management. Although Lean Construction (LC) aims to minimize waste, manual monitoring lacks the high-frequency data required for timely control. Computer Vision (CV) offers automated monitoring but suffers from a "Semantic Gap," where models detect low-level objects but fail to interpret high-level Lean states (e.g., "waiting"). This study proposes the Lean Construction Visual Taxonomy (LCVT), a three-level hierarchical framework–Category, Indicator, Visual Definition grounded in Transformation-Flow-Value (TFV) theory. Crucially, the LCVT provides standardized class definitions to guide "zero-shot" prompt engineering in Vision-Language Models (VLMs). By injecting formal L3 definitions that address entity types, temporal thresholds (e.g., stationary >60 s), and spatial context into VLM models such as GPT-4o and Gemini 2.5, the framework enables sophisticated, lean reasoning without the need for massive custom-labeled datasets. Pilot validation achieved a 0.946 mAP in distinguishing state-dependent equipment loads. By formalizing the visual signatures of waste, the LCVT establishes the data infrastructure necessary for proactive, VLM-driven decision support in construction AI. KW - AI KW - transformation-flow-value KW - computer vision KW - taxonomy KW - visual management. PB - T2 - Proceedings of the 34th Annual Conference of the International Group for Lean Construction (IGLC 34) DA - 2026/06/22 CY - Singapore, Singapore L1 - http://iglc.net/Papers/Details/2468/pdf L2 - http://iglc.net/Papers/Details/2468 N1 - Export Date: 19 June 2026 DB - IGLC.net DP - IGLC LA - English ER -