[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3531073.3534468acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaviConference Proceedingsconference-collections
demonstration

Corpus Summarization and Exploration using Multi-Mosaics

Published: 06 June 2022 Publication History

Abstract

In fields such as translation studies and computational linguistics, various tools are used to analyze the content of text corpora, and extract keywords and other entities for analysis. Concordancing – arranging passages of text corpus in alphabetical order of user-defined keywords – is one of most widely used forms of text analysis. This paper describes Multi-Mosaics, a tool for text analysis using multiple implicitly linked Concordance Mosaic visualisations. Multi-Mosaics supports examining linguistic relationships within the context windows surrounding multiple extracted keywords.

References

[1]
Mona Baker. 1993. Corpus linguistics and translation studies: Implications and applications. Text and technology: In honour of John Sinclair 233 (1993), 250.
[2]
Mona Baker. 2020. Rehumanizing the migrant: the translated past as a resource for refashioning the contemporary discourse of the (radical) left. Palgrave Communications 6, 1 (2020), 1–16.
[3]
S. Bernardini and D. Kenny. 2020. Corpora. In The Routledge Handbook of Translation Studies, M. Bake and G. Saldanha (Eds.). Routledge, 110–115. In press.
[4]
Michael Bostock, Vadim Ogievetsky, and Jeffrey Heer. 2011. D3 Data-Driven Documents. IEEE Transactions on Visualization and Computer Graphics 17, 12 (Dec. 2011), 2301–2309.
[5]
Jan Buts. 2020. Community and authority in ROAR Magazine. Palgrave Communications 6, 1 (2020), 1–12.
[6]
Jan Buts, Mona Baker, Saturnino Luz, and Eivind Engebretsen. 2021. Epistemologies of evidence-based medicine: a plea for corpus-based conceptual research in the medical humanities. Medicine, Health Care and Philosophy(2021), 1–12.
[7]
Jinho Choi and Yong-Sik Hwang. 2014. Patent keyword network analysis for improving technology development efficiency. Technological Forecasting and Social Change 83 (2014), 170–182. https://doi.org/10.1016/j.techfore.2013.07.004
[8]
Chris Culy and Verena Lyding. 2010. Double Tree: An Advanced KWIC Visualization for Expert Users. In Information Visualisation (IV), 2010 14th International Conference. 98–103. https://doi.org/10.1109/IV.2010.24
[9]
Chris Culy and Verena Lyding. 2011. Corpus Clouds - Facilitating Text Analysis by Means of Visualizations. In Human Language Technology. Challenges for Computer Science and Linguistics. Lecture Notes in Computer Science, Vol. 6562. Springer Berlin Heidelberg, 351–360. https://doi.org/10.1007/978-3-642-20095-3_32
[10]
Cristian Felix, Steven Franconeri, and Enrico Bertini. 2018. Taking Word Clouds Apart: An Empirical Investigation of the Design Space for Keyword Summaries. IEEE Transactions on Visualization and Computer Graphics 24, 1(2018), 657–666. https://doi.org/10.1109/TVCG.2017.2746018
[11]
Petra Isenberg, Tobias Isenberg, Michael Sedlmair, Jian Chen, and Torsten Möller. 2017. Visualization as Seen through its Research Paper Keywords. IEEE Transactions on Visualization and Computer Graphics 23, 1(2017), 771–780. https://doi.org/10.1109/TVCG.2016.2598827
[12]
Stefan Jänicke and Gerik Scheuermann. 2017. On the Visualization of Hierarchical Relations and Tree Structures with TagSpheres. In Computer Vision, Imaging and Computer Graphics Theory and Applications, José Braz, Nadia Magnenat-Thalmann, Paul Richard, Lars Linsen, Alexandru Telea, Sebastiano Battiato, and Francisco Imai (Eds.). Springer International Publishing, 199–219.
[13]
Huajiao Li, Haizhong An, Yue Wang, Jiachen Huang, and Xiangyun Gao. 2016. Evolutionary features of academic articles co-keyword network and keywords co-occurrence network: Based on two-mode affiliation network. Physica A: Statistical Mechanics and its Applications 450 (2016), 657–669. https://doi.org/10.1016/j.physa.2016.01.017
[14]
H. P. Luhn. 1960. Key word-in-context index for technical literature (kwic index). American Documentation 11, 4 (1960), 288–295. https://doi.org/10.1002/asi.5090110403
[15]
Saturnino Luz and Shane Sheehan. 2014. A Graph Based Abstraction of Textual Concordances and Two Renderings for their Interactive Visualisation. In Proceedings of the International Working Conference on Advanced Visual Interfaces (Como, Italy) (AVI ’14). ACM, New York, NY, USA, 293–296. https://doi.org/10.1145/2598153.2598187
[16]
Saturnino Luz and Shane Sheehan. 2020. Methods and visualization tools for the analysis of medical, political and scientific concepts in Genealogies of Knowledge. Palgrave Communications 6, 1 (2020), 1–20.
[17]
Ben Shneiderman. 1996. The eyes have it: a task by data type taxonomy for information visualizations. In Proceedings the IEEE Symposium on Visual Languages. 336–343. https://doi.org/10.1109/VL.1996.545307
[18]
Jan Svartvik. 2011. Directions in corpus linguistics: proceedings of Nobel Symposium 82 Stockholm, 4-8 August 1991. Vol. 65. Walter de Gruyter.
[19]
Fernanda Viégas and Martin Wattenberg. 2008. Tag clouds and the case for vernacular visualization. Interactions 15, 4 (2008), 49–52.
[20]
Martin Wattenberg and Fernanda B Viégas. 2008. The word tree, an interactive visual concordance. IEEE Transactions on Visualization and Computer Graphics 14, 6(2008), 1221–1228.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
AVI '22: Proceedings of the 2022 International Conference on Advanced Visual Interfaces
June 2022
414 pages
ISBN:9781450397193
DOI:10.1145/3531073
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 06 June 2022

Check for updates

Author Tags

  1. Text visualization
  2. concordance visualisation.
  3. text analysis

Qualifiers

  • Demonstration
  • Research
  • Refereed limited

Conference

AVI 2022

Acceptance Rates

Overall Acceptance Rate 128 of 490 submissions, 26%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 43
    Total Downloads
  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 18 Jan 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media