
Forventes utgitt: 19.05.2026
Leveringstid: 7-30 dager
Handlinger
Beskrivelse
Omtale
Test-driven data analysis is the synthesis of ideas from test-driven development of software to data-intensive work including data science, data analysis, and data engineering. It is a methodology for improving the quality of data and of analytical pipelines and processes. It can be thought of as data analysis as if the answers actually matter. Test-driven data analysis can be thought of as a sibling to reproducible research, with similar concerns, but greater emphasis on automated testing, and less requirement for a human to reproduce results. Extensive checklists are provided that can be used to improve quality before,during, and after analysis. Key Features:Prevents costly errors in analytical processes before they reach production through automated data validation and reference testing of data pipelines. • Provides actionable checklists for issues beyond the reach of automated testing. • Equips readers with open-source Python tools and language-agnostic command-line interfaces. • Addresses testing challenges for modern LLM-based systems including chat-bots and coding assistants. • Instills in analysts an inner voice that is always asking: “How is this misleading data misleading me?”
Detaljer
-
ISBN:
9781032897158
-
Språk:
, Engelsk
-
Forlag:
Chapman & Hall/CRC -
Fagtema:
-
Serie:
Chapman & Hall/CRC Data Science Series
-
Litteraturtype:
-
Sider:
424
-
Høyde:
23.4 cm
-
Bredde:
15.6 cm