Data and information visualization

Professor Edward Tufte described Charles Joseph Minard's 1869 graphic of the French invasion of Russia as potentially "the best statistical graphic ever drawn", noting it captures 6 variables in 2 dimensions.[1]

Data and information visualization (data viz/vis or info viz/vis) is the practice of designing and creating graphic or visual representations of[2] quantitative and qualitative data and information with the help of static, dynamic or interactive visual items. These visualizations are intended to help a target audience visually explore and discover, quickly understand, interpret and gain important insights into otherwise difficult-to-identify structures, relationships, correlations, local and global patterns, trends, variations, constancy, clusters, outliers and unusual groupings within data.[3][4][5] When intended for the public to convey a concise version of information in an engaging manner,[3] it is typically called infographics.

Data visualization is concerned with presenting sets of primarily quantitative raw data in a schematic form, using imagery. The visual formats used in data visualization include charts and graphs, geospatial maps, figures, correlation matrices, percentage gauges, etc..

Information visualization deals with multiple, large-scale and complicated datasets which contain quantitative data, as well as qualitative, and primarily abstract information, and its goal is to add value to raw data, improve the viewers' comprehension, reinforce their cognition and help derive insights and make decisions as they navigate and interact with the graphical display. Visual tools used include maps for location based data; hierarchical[6] organisations of data; displays that prioritise relationships such as Sankey diagrams; flowcharts, timelines.

Emerging technologies like virtual, augmented and mixed reality have the potential to make information visualization more immersive, intuitive, interactive and easily manipulable and thus enhance the user's visual perception and cognition.[7] In data and information visualization, the goal is to graphically present and explore abstract, non-physical and non-spatial data collected from databases, information systems, file systems, documents, business data, which is different from scientific visualization, where the goal is to render realistic images based on physical and spatial scientific data to confirm or reject hypotheses.[8]

Effective data visualization is properly sourced, contextualized, simple and uncluttered. The underlying data is accurate and up-to-date to ensure insights are reliable. Graphical items are well-chosen and aesthetically appealing, with shapes, colors and other visual elements used deliberately in a meaningful and non-distracting manner. The visuals are accompanied by supporting texts. Verbal and graphical components complement each other to ensure clear, quick and memorable understanding. Effective information visualization is aware of the needs and expertise level of the target audience.[9][2] Effective visualization can be used for conveying specialized, complex, big data-driven ideas to a non-technical audience in a visually appealing, engaging and accessible manner, and domain experts and executives for making decisions, monitoring performance, generating ideas and stimulating research.[9][3] Data scientists, analysts and data mining specialists use data visualization to check data quality, find errors, unusual gaps, missing values, clean data, explore the structures and features of data, and assess outputs of data-driven models.[3] Data and information visualization can be part of data storytelling, where they are paired with a narrative structure, to contextualize the analyzed data and communicate insights gained from analyzing it to convince the audience into making a decision or taking action.[2][10] This can be contrasted with statistical graphics, where complex data are communicated graphically among researchers and analysts to help them perform exploratory data analysis or convey results of such analyses, where visual appeal, capturing attention to a certain issue and storytelling are less important.[11]

Data and information visualization is interdisciplinary, it incorporates principles found in descriptive statistics,[12] visual communication, graphic design, cognitive science and, interactive computer graphics and human-computer interaction.[13] Since effective visualization requires design skills, statistical skills and computing skills, it is both an art and a science.[14] Visual analytics marries statistical data analysis, data and information visualization and human analytical reasoning through interactive visual interfaces to help users reach conclusions, gain actionable insights and make informed decisions which are otherwise difficult for computers to do. Research into how people read and misread types of visualizations helps to determine what types and features of visualizations are most understandable and effective.[15][16] Unintentionally poor or intentionally misleading and deceptive visualizations can function as powerful tools which disseminate misinformation, manipulate public perception and divert public opinion.[17] Thus data visualization literacy has become an important component of data and information literacy in the information age akin to the roles played by textual, mathematical and visual literacy in the past.[18]

  1. ^ Corbett, John. "Charles Joseph Minard: Mapping Napoleon's March, 1861". Center for Spatially Integrated Social Science. Archived from the original on 19 June 2003. (CSISS website has moved; use archive link for article)
  2. ^ a b c Nussbaumer Knaflic, Cole (2 November 2015). Storytelling with Data: A Data Visualization Guide for Business Professionals. John Wiley & Sons. ISBN 978-1-119-00225-3.
  3. ^ a b c d Antony Unwin (31 January 2020). "Why Is Data Visualization Important? What Is Important in Data Visualization?". Harvard Data Science Review. 2 (1). doi:10.1162/99608f92.8ae4d525. Retrieved 27 March 2023.
  4. ^ Ananda Mitra (2018), "Managing and Visualizing Unstructured Big Data", Encyclopedia of Information Science and Technology (4th ed.), IGI Global
  5. ^ Bhuvanendra Putchala; Lasya Sreevidya Kanala; Devi Prasanna Donepudi; Hari Kishan Kondaveeti (2023), "Applications of Big Data Analytics in Healthcare Informatics", in Narasimha Rao Vajjhala; Philip Eappen (eds.), Health Informatics and Patient Safety in Times of Crisis, IGI Global, pp. 175–194
  6. ^ Heer, Jeffrey, Bostock, Michael, Ogievetsky, Vadim (2010) A tour through the visualization zoo, Communications of the ACM, Volume 53, Issue 6 Pages 59 - 67 https://doi.org/10.1145/1743546.1743567
  7. ^ Olshannikova, Ekaterina; Ometov, Aleksandr; Koucheryavy, Yevgeny; Ollson, Thomas (2015), "Visualizing Big Data with augmented and virtual reality: challenges and research agenda.", Journal of Big Data, 2 (22), doi:10.1186/s40537-015-0031-2
  8. ^ Card, Mackinlay, and Shneiderman (1999), Readings in Information Visualization: Using Vision to Think, Morgan Kaufmann, pp. 6–7{{citation}}: CS1 maint: multiple names: authors list (link)
  9. ^ a b "What is data visualization?". IBM. 28 September 2021. Retrieved 27 March 2023.
  10. ^ Brent Dykes (2019), Effective Data Storytelling: How to Drive Change with Data, Narrative and Visuals, John Wiley & Sons, p. 16
  11. ^ David C. LeBlanc (2004), Statistics: Concepts and Applications for Science, Jones & Bartlett Learning, pp. 35–36
  12. ^ Grandjean, Martin (2022). "Data Visualization for History". Handbook of Digital Public History: 291–300. doi:10.1515/9783110430295-024. ISBN 9783110430295.
  13. ^ E.H. Chi (2013), A Framework for Visualizing Information, Springer Science & Business Media, p. xxiii
  14. ^ Gershon, Nahum; Page, Ward (1 August 2001). "What storytelling can do for information visualization". Communications of the ACM. 44 (8): 31–37. doi:10.1145/381641.381653. S2CID 7666107.
  15. ^ Mason, Betsy (November 12, 2019). "Why scientists need to be better at data visualization". Knowable Magazine. doi:10.1146/knowable-110919-1.
  16. ^ O'Donoghue, Seán I.; Baldi, Benedetta Frida; Clark, Susan J.; Darling, Aaron E.; Hogan, James M.; Kaur, Sandeep; Maier-Hein, Lena; McCarthy, Davis J.; Moore, William J.; Stenau, Esther; Swedlow, Jason R.; Vuong, Jenny; Procter, James B. (2018-07-20). "Visualization of Biomedical Data". Annual Review of Biomedical Data Science. 1 (1): 275–304. doi:10.1146/annurev-biodatasci-080917-013424. hdl:10453/125943. S2CID 199591321. Retrieved 25 June 2021.
  17. ^ Leo Yu-Ho Lo; Ayush Gupta; Kento Shigyo; Aoyu Wu; Enrico Bertini; Huamin Qu, Misinformed by Visualization: What Do We Learn From Misinformative Visualizations?
  18. ^ Börner, K.; Bueckle, A.; Ginda, M. (2019), "Data visualization literacy: Definitions, conceptual frameworks, exercises, and assessments", Proceedings of the National Academy of Sciences, 116 (6): 1857–1864, Bibcode:2019PNAS..116.1857B, doi:10.1073/pnas.1807180116, PMC 6369751, PMID 30718386

© MMXXIII Rich X Search. We shall prevail. All rights reserved. Rich X Search