[go: up one dir, main page]
More Web Proxy on the site http://driver.im/ skip to main content
10.1145/3586183.3606791acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article
Open access

Living Papers: A Language Toolkit for Augmented Scholarly Communication

Published: 29 October 2023 Publication History

Abstract

Computing technology has deeply shaped how academic articles are written and produced, yet article formats and affordances have changed little over centuries. The status quo consists of digital files optimized for printed paper—ill-suited to interactive reading aids, accessibility, dynamic figures, or easy information extraction and reuse. Guided by formative discussions with scholarly communication researchers and publishing tool developers, we present Living Papers, a language toolkit for producing augmented academic articles that span print, interactive, and computational media. Living Papers articles may include formatted text, references, executable code, and interactive components. Articles are parsed into a standardized document format from which a variety of outputs are generated, including static PDFs, dynamic web pages, and extraction APIs for paper content and metadata. We describe Living Papers’ architecture, document model, and reactive runtime, and detail key aspects such as citation processing and conversion of interactive components to static content. We demonstrate the use and extension of Living Papers through examples spanning traditional research papers, explorable explanations, information extraction, and reading aids such as enhanced citations, cross-references, and equations. Living Papers is available as an extensible, open source platform intended to support both article authors and researchers of augmented reading and writing experiences.

Supplemental Material

ZIP File
Supplemental File

References

[1]
Allen Institute for Artificial Intelligence, Semantic Scholar Team. 2023. Semantic Reader. https://www.semanticscholar.org/product/semantic-reader.
[2]
Waleed Ammar, Dirk Groeneveld, Chandra Bhagavatula, Iz Beltagy, Miles Crawford, Doug Downey, Jason Dunkelberger, Ahmed Elgohary, Sergey Feldman, Vu Ha, Rodney Kinney, Sebastian Kohlmeier, Kyle Lo, Tyler Murray, Hsu-Han Ooi, Matthew Peters, Joanna Power, Sam Skjonsberg, Lucy Wang, Chris Willhelm, Zheng Yuan, Madeleine Zuylen, and oren. 2018. Construction of the Literature Graph in Semantic Scholar. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 3 (Industry Papers). Association for Computational Linguistics. https://doi.org/10.18653/v1/n18-3011
[3]
Tal August, Lucy Lu Wang, Jonathan Bragg, Marti A. Hearst, Andrew Head, and Kyle Lo. 2022. Paper Plain: Making Medical Research Papers Approachable to Healthcare Consumers with Natural Language Processing. (2022). https://doi.org/10.48550/ARXIV.2203.00130
[4]
Sriram Karthik Badam, Zhicheng Liu, and Niklas Elmqvist. 2019. Elastic Documents: Coupling Text and Tables through Contextual Visualizations for Enhanced Document Reading. IEEE Transactions on Visualization and Computer Graphics 25, 1 (2019), 661–671. https://doi.org/10.1109/tvcg.2018.2865119
[5]
Tim Berners-Lee, Robert Cailliau, Ari Luotonen, Henrik Frystyk Nielsen, and Arthur Secret. 1994. The World-Wide Web. Commun. ACM 37, 8 (1994), 76–82. https://doi.org/10.1145/179606.179671
[6]
Jeffrey P. Bigham, Erin L. Brady, Cole Gleason, Anhong Guo, and David A. Shamma. 2016. An Uninteresting Tour Through Why Our Research Papers Aren’t Accessible. In Proceedings of the 2016 CHI Conference Extended Abstracts on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/2851581.2892588
[7]
M. Bostock, V. Ogievetsky, and J. Heer. 2011. D3 Data-Driven Documents. IEEE Transactions on Visualization and Computer Graphics 17, 12 (2011), 2301–2309. https://doi.org/10.1109/tvcg.2011.185
[8]
[8] Citation Style Language. 2023. https://citationstyles.org/.
[9]
Matthew Conlen and Jeffrey Heer. 2018. Idyll. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. ACM. https://doi.org/10.1145/3242587.3242600
[10]
Matthew Conlen and Jeffrey Heer. 2022. Fidyll: A Compiler for Cross-Format Data Stories & Explorable Explanations. (2022). https://doi.org/10.48550/ARXIV.2205.09858
[11]
Matthew Conlen, Megan Vo, Alan Tan, and Jeffrey Heer. 2021. Idyll Studio: A Structured Editor for Authoring Interactive & Data-Driven Articles. In The 34th Annual ACM Symposium on User Interface Software and Technology. ACM. https://doi.org/10.1145/3472749.3474731
[12]
Will Crichton. 2023. A New Medium for Communicating Research on Programming Languages. https://willcrichton.net/nota/.
[13]
[13] Curvenote. 2023. https://curvenote.com/.
[14]
Pierre Dragicevic, Yvonne Jansen, Abhraneel Sarma, Matthew Kay, and Fanny Chevalier. 2019. Increasing the Transparency of Research Papers with Explorable Multiverse Analyses. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3290605.3300295
[15]
Emily Eisenberg and Sophie Alpert. 2023. KaTeX: The fastest math typesetting library for the web. https://katex.org.
[16]
Raymond Fok, Hita Kambhamettu, Luca Soldaini, Jonathan Bragg, Kyle Lo, Andrew Head, Marti A. Hearst, and Daniel S. Weld. 2022. Scim: Intelligent Skimming Support for Scientific Papers. (2022). https://doi.org/10.48550/ARXIV.2205.04561
[17]
Santo Fortunato, Carl T. Bergstrom, Katy Börner, James A. Evans, Dirk Helbing, Staša Milojević, Alexander M. Petersen, Filippo Radicchi, Roberta Sinatra, Brian Uzzi, Alessandro Vespignani, Ludo Waltman, Dashun Wang, and Albert-László Barabási. 2018. Science of science. Science 359, 6379 (2018). https://doi.org/10.1126/science.aao0185
[18]
Google, Inc.2023. Puppeteer. https://pptr.dev/.
[19]
[19] Google Scholar. 2023. https://scholar.google.com/.
[20]
John Gruber. 2004. Markdown. https://daringfireball.net/projects/markdown/.
[21]
Steve Harrison, Scott Minneman, Maribeth Back, Anne Balsamo, Mark Chow, Rich Gold, Matt Gorbet, and Dale Mac Donald. 2001. Design: the what of XFR. Interactions 8, 3 (2001), 21–30. https://doi.org/10.1145/369825.369829
[22]
Andrew Head, Kyle Lo, Dongyeop Kang, Raymond Fok, Sam Skjonsberg, Daniel S. Weld, and Marti A. Hearst. 2021. Augmenting Scientific Papers with Just-in-Time, Position-Sensitive Definitions of Terms and Symbols. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3411764.3445648
[23]
Andrew Head, Amber Xie, and Marti A. Hearst. 2022. Math Augmentation: How Authors Enhance the Readability of Formulas using Novel Visual Design Practices. In CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3491102.3501932
[24]
Jeffrey Heer. 2021. Fast & Accurate Gaussian Kernel Density Estimation. In 2021 IEEE Visualization Conference (VIS). IEEE. https://doi.org/10.1109/vis49827.2021.9623323
[25]
William C. Hill, James D. Hollan, Dave Wroblewski, and Tim McCandless. 1992. Edit wear and read wear. In Proceedings of the SIGCHI conference on Human factors in computing systems - CHI ’92. ACM Press. https://doi.org/10.1145/142750.142751
[26]
Daniel S. Himmelstein, Vincent Rubinetti, David R. Slochower, Dongbo Hu, Venkat S. Malladi, Casey S. Greene, and Anthony Gitter. 2019. Open collaborative writing with Manubot. PLOS Computational Biology 15, 6 (2019), e1007128. https://doi.org/10.1371/journal.pcbi.1007128
[27]
Tom Hope, Doug Downey, Oren Etzioni, Daniel S. Weld, and Eric Horvitz. 2022. A Computational Inflection for Scientific Discovery. (2022). https://doi.org/10.48550/ARXIV.2205.02007
[28]
[28] Jupyter Book. 2023. https://jupyterbook.org/.
[29]
Dongyeop Kang, Andrew Head, Risham Sidhu, Kyle Lo, Daniel S. Weld, and Marti A. Hearst. 2020. Document-Level Definition Detection in Scholarly Documents: Existing Models, Error Analyses, and Future Directions. (2020). https://doi.org/10.48550/ARXIV.2010.05129
[30]
Dae Hyun Kim, Enamul Hoque, Juho Kim, and Maneesh Agrawala. 2018. Facilitating Document Reading by Linking Text and Tables. In Proceedings of the 31st Annual ACM Symposium on User Interface Software and Technology. ACM. https://doi.org/10.1145/3242587.3242617
[31]
Rodney Kinney, Chloe Anastasiades, Russell Authur, Iz Beltagy, Jonathan Bragg, Alexandra Buraczynski, Isabel Cachola, Stefan Candra, Yoganand Chandrasekhar, Arman Cohan, Miles Crawford, Doug Downey, Jason Dunkelberger, Oren Etzioni, Rob Evans, Sergey Feldman, Joseph Gorney, David Graham, Fangzhou Hu, Regan Huff, Daniel King, Sebastian Kohlmeier, Bailey Kuehl, Michael Langan, Daniel Lin, Haokun Liu, Kyle Lo, Jaron Lochner, Kelsey MacMillan, Tyler Murray, Chris Newell, Smita Rao, Shaurya Rohatgi, Paul Sayre, Zejiang Shen, Amanpreet Singh, Luca Soldaini, Shivashankar Subramanian, Amber Tanaka, Alex D. Wade, Linda Wagner, Lucy Lu Wang, Chris Wilhelm, Caroline Wu, Jiangjiang Yang, Angele Zamarron, Madeleine Van Zuylen, and Daniel S. Weld. 2023. The Semantic Scholar Open Data Platform. (2023). https://doi.org/10.48550/ARXIV.2301.10140
[32]
Clemens N. Klokmose, James R. Eagan, Siemen Baader, Wendy Mackay, and Michel Beaudouin-Lafon. 2015. Webstrates. In Proceedings of the 28th Annual ACM Symposium on User Interface Software & Technology. ACM. https://doi.org/10.1145/2807442.2807446
[33]
Thomas Kluyver, Benjamin Ragan-Kelley, Fernando Pérez, Brian E Granger, Matthias Bussonnier, Jonathan Frederic, Kyle Kelley, Jessica B Hamrick, Jason Grout, Sylvain Corlay, and others. 2016. Jupyter Notebooks-a publishing format for reproducible computational workflows. Vol. 2016.
[34]
D. E. Knuth. 1979. TEX and METAFONT: New directions in typesetting. American Mathematical Society.
[35]
D. E. Knuth. 1984. Literate Programming. Comput. J. 27, 2 (1984), 97–111. https://doi.org/10.1093/comjnl/27.2.97
[36]
Leslie Lamport. 1985. LaTeX: A Document Preparation System. Addison-Wesley Professional.
[37]
Patrice Lopez. 2009. GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications. Springer Berlin Heidelberg, 473–474. https://doi.org/10.1007/978-3-642-04346-8_62
[38]
Bruce D. Lucas and Takeo Kanade. 1981. An Iterative Image Registration Technique with an Application to Stereo Vision. In International Joint Conference on Artificial Intelligence.
[39]
John MacFarlane. 2023. Pandoc: A Universal Document Converter. https://pandoc.org/.
[40]
Meta Open Source. 2023. React. https://react.dev/.
[41]
[41] MyST Markdown. 2023. https://myst-tools.org/.
[42]
T. H. Nelson. 1965. Complex information processing. In Proceedings of the 1965 20th national conference on -. ACM Press. https://doi.org/10.1145/800197.806036
[43]
T. H. Nelson. 1981. Literary Machines. Mindful Press.
[44]
[44] Observable. 2023. https://observablehq.com/.
[45]
[45] Observable Inputs. 2023. https://github.com/observablehq/inputs.
[46]
[46] Observable Runtime. 2023. https://github.com/observablehq/runtime.
[47]
Overleaf. 2023. Online LaTeX Editor. https://www.overleaf.com/.
[48]
Thomas A Phelps and Robert Wilensky. 2000. Robust intra-document locations. Computer Networks 33, 1-6 (2000), 105–118. https://doi.org/10.1016/s1389-1286(00)00043-8
[49]
[49] Quarto. 2023. https://quarto.org/.
[50]
Napol Rachatasumrit, Jonathan Bragg, Amy X. Zhang, and Daniel S Weld. 2022. CiteRead: Integrating Localized Citation Contexts into Scientific Paper Reading. In 27th International Conference on Intelligent User Interfaces. ACM. https://doi.org/10.1145/3490099.3511162
[51]
Stuart Ritchie. 2022. The Big Idea: Should we get rid of the scientific paper?https://www.theguardian.com/books/2022/apr/11/the-big-idea-should-we-get-rid-of-the-scientific-paper. The Guardian 11 (2022).
[52]
[52] RMarkdown. 2023. https://rmarkdown.rstudio.com/.
[53]
Adam Rule, Aurélien Tabard, and James D. Hollan. 2018. Exploration and Explanation in Computational Notebooks. In Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3173574.3173606
[54]
Arvind Satyanarayan, Dominik Moritz, Kanit Wongsuphasawat, and Jeffrey Heer. 2017. Vega-Lite: A Grammar of Interactive Graphics. IEEE Transactions on Visualization and Computer Graphics 23, 1 (2017), 341–350. https://doi.org/10.1109/tvcg.2016.2599030
[55]
Arvind Satyanarayan, Ryan Russell, Jane Hoffswell, and Jeffrey Heer. 2016. Reactive Vega: A Streaming Dataflow Architecture for Declarative Interactive Visualization. IEEE Transactions on Visualization and Computer Graphics 22, 1 (2016), 659–668. https://doi.org/10.1109/tvcg.2015.2467091
[56]
Manolis Savva, Nicholas Kong, Arti Chhajta, Li Fei-Fei, Maneesh Agrawala, and Jeffrey Heer. 2011. ReVision. In Proceedings of the 24th annual ACM symposium on User interface software and technology. ACM. https://doi.org/10.1145/2047196.2047247
[57]
Zejiang Shen, Kyle Lo, Lucy Lu Wang, Bailey Kuehl, Daniel S. Weld, and Doug Downey. 2022. VILA: Improving Structured Content Extraction from Scientific PDFs Using Visual Layout Groups. Transactions of the Association for Computational Linguistics 10 (2022), 376–392. https://doi.org/10.1162/tacl_a_00466
[58]
Nicole Sultanum, Fanny Chevalier, Zoya Bylinskii, and Zhicheng Liu. 2021. Leveraging Text-Chart Links to Support Authoring of Data-Driven Articles with VizFlow. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/3411764.3445354
[59]
Editorial Team. 2021. Distill Hiatus. Distill 6, 7 (2021). https://doi.org/10.23915/distill.00031
[60]
The Alliance for Networking Visual Culture. 2023. Scalar. https://scalar.me/anvc/scalar/.
[61]
Typst. 2023. Typst: Compose papers faster. https://typst.app/.
[62]
Bret Victor. 2011. Explorable Explanations. http://worrydream.com/ExplorableExplanations/.
[63]
Lucy Lu Wang, Isabel Cachola, Jonathan Bragg, Evie Yu-Yen Cheng, Chelsea Haupt, Matt Latzke, Bailey Kuehl, Madeleine N van Zuylen, Linda Wagner, and Daniel Weld. 2021. SciA11y: Converting Scientific Papers to Accessible HTML. In The 23rd International ACM SIGACCESS Conference on Computers and Accessibility. ACM. https://doi.org/10.1145/3441852.3476545
[64]
Lars Willighagen. 2023. Citation.js. https://citation.js.org/.
[65]
Gary Wolf. 1995. The Curse of Xanadu. https://www.wired.com/1995/06/xanadu/. In Wired.
[66]
[66] Workshop on Visualization for AI Explainability. 2022. http://visxai.io/.
[67]
Polle T. Zellweger, Susan Harkness Regli, Jock D. Mackinlay, and Bay-Wei Chang. 2000. The impact of fluid documents on reading and browsing. In Proceedings of the SIGCHI conference on Human Factors in Computing Systems. ACM. https://doi.org/10.1145/332040.332440
[68]
Jonathan Zong, Crystal Lee, Alan Lundgard, JiWoong Jang, Daniel Hajas, and Arvind Satyanarayan. 2022. Rich Screen Reader Experiences for Accessible Data Visualization. Computer Graphics Forum 41, 3 (2022), 15–27. https://doi.org/10.1111/cgf.14519

Cited By

View all
  • (2024)Dynamic Abstractions: Building the Next Generation of Cognitive Tools and InterfacesAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686706(1-3)Online publication date: 13-Oct-2024
  • (2024)A Deixis-Centered Approach for Documenting Remote Synchronous Communication Around Data VisualizationsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345635131:1(930-940)Online publication date: 10-Sep-2024
  • (2024)MaugVLink: Augmenting Mathematical Formulas with Visual Links2024 IEEE 17th Pacific Visualization Conference (PacificVis)10.1109/PacificVis60374.2024.00048(337-342)Online publication date: 23-Apr-2024

Index Terms

  1. Living Papers: A Language Toolkit for Augmented Scholarly Communication

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    UIST '23: Proceedings of the 36th Annual ACM Symposium on User Interface Software and Technology
    October 2023
    1825 pages
    ISBN:9798400701320
    DOI:10.1145/3586183
    This work is licensed under a Creative Commons Attribution International 4.0 License.

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 29 October 2023

    Check for updates

    Badges

    • Honorable Mention

    Author Tags

    1. Academic Publishing
    2. Augmented Reading
    3. Interactive Articles

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    UIST '23

    Acceptance Rates

    Overall Acceptance Rate 561 of 2,567 submissions, 22%

    Upcoming Conference

    UIST '25
    The 38th Annual ACM Symposium on User Interface Software and Technology
    September 28 - October 1, 2025
    Busan , Republic of Korea

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)1,203
    • Downloads (Last 6 weeks)146
    Reflects downloads up to 13 Jan 2025

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Dynamic Abstractions: Building the Next Generation of Cognitive Tools and InterfacesAdjunct Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology10.1145/3672539.3686706(1-3)Online publication date: 13-Oct-2024
    • (2024)A Deixis-Centered Approach for Documenting Remote Synchronous Communication Around Data VisualizationsIEEE Transactions on Visualization and Computer Graphics10.1109/TVCG.2024.345635131:1(930-940)Online publication date: 10-Sep-2024
    • (2024)MaugVLink: Augmenting Mathematical Formulas with Visual Links2024 IEEE 17th Pacific Visualization Conference (PacificVis)10.1109/PacificVis60374.2024.00048(337-342)Online publication date: 23-Apr-2024

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media