NeFut Logo NeFut
Admin Login

[CS.AI] Agentic Retrieval Framework for Context-Aware Data Quality Assessment

Published at: 2026-06-15 22:00 Last updated: 2026-06-16 12:13
#algorithm #AI #Machine Learning

Data quality assessment is a critical prerequisite for effective data analytics and data-driven decision-making, yet it remains a challenging task due to the inherently context-dependent nature of data quality. Existing approaches often rely on static rules or manual assessment strategies, limiting their adaptability to diverse usage scenarios and constraining automation at scale. Recent advances in artificial intelligence, particularly large language models, offer new opportunities for automating data quality assessment, but raise concerns related to reliability, grounding, and execution safety.

In this paper, we propose a unified agentic-retrieval framework for autonomous context-aware data quality assessment. The framework interprets natural-language descriptions of intended data usage, derives context-aware assessment strategies, and generates executable validation logic through a multi-agent workflow.

To ensure operational reliability, the framework introduces a feasibility validation stage that evaluates the realism and executability of generated assessment specifications before execution, enabling iterative refinement when necessary. Accepted validation logic is executed deterministically to guarantee reproducible and auditable results.

We implement the proposed framework as an end-to-end prototype and evaluate it across multiple usage scenarios applied to the same dataset. The results demonstrate that assessment outcomes adapt meaningfully to different intended uses, while feasibility-gated execution reduces unrealistic or non-executable rule generation. The proposed approach provides a practical foundation for deploying autonomous yet controlled data quality assessment in modern data-driven environments.

Blogger's Review: The framework presented in this paper significantly enhances the intelligence of data quality assessment by integrating natural language processing with multi-agent systems. The introduction of a feasibility validation stage ensures the reliability of assessment logic, providing a solid foundation for practical applications. This innovative approach deserves to be promoted in a broader range of data analytics scenarios.

Original Source: https://arxiv.org/abs/2606.13692

[h] Back to Home