An Automated Repository for the Efficient Management of Complex Documentation
<p>Automated repository implementation structure.</p> "> Figure 2
<p>Example of a call to GPT-4o API.</p> "> Figure 3
<p>Update page.</p> "> Figure 4
<p>Messages sent to GPT models.</p> "> Figure 5
<p>Simplification of document collection and classification processes.</p> "> Figure 6
<p>Example of a document stored in MongoDB’s documents collection.</p> "> Figure 7
<p>Flowchart for the process of adding documents to the automated repository.</p> "> Figure 8
<p>Repository’s main page.</p> "> Figure 9
<p>Regenerate page.</p> "> Figure 10
<p>Regenerate document POST request.</p> "> Figure 11
<p>Relation graph.</p> "> Figure 12
<p>Relations graph detail.</p> "> Figure 13
<p>Number of documents grouped by area.</p> "> Figure 14
<p>Documents issued over time and grouped by origin.</p> "> Figure 15
<p>Cumulative count of documents present in the repository.</p> "> Figure 16
<p>Number of documents by type.</p> "> Figure 17
<p>Number of documents issued by year and by area.</p> "> Figure 18
<p>New Documents page.</p> ">
Abstract
:1. Introduction
- LLM Models indicates whether the referenced works implemented the most recent models available (contemporary to GPT-3.5 Turbo and later).
- Document Classification specifies whether the authors developed document classification functionalities in their work.
- Provides Insights indicates whether the proposed NLP-based system extracts relevant information from the documents.
- LLM-Based Repository applies to works where a documentation repository was built using LLM technologies.
- Extra Features applies to works that included additional functionalities, such as user interfaces, statistical analysis, document filters, and other enhancements integrated with LLM applications.
Proposal | Latest LLM Models | Document Classification | Provides Insights | LLM Based Repository | Extra Features |
---|---|---|---|---|---|
Ghumade et al. [12] | x | x | |||
Cascella et al. [13] | x | x | x | ||
Merchant et al. [14] | x | x | |||
Feyisa et al. [15] | x | x | x | x | |
Saka et al. [16] | x | ||||
Savelka et al. [17] | x | x | |||
Liu et al. [18] | x | x | |||
Aladag et al. [19] | x | x | |||
Thippeswamy et al. [20] | x | x | x | ||
Vallabhaneni et al. [21] | x | x | |||
Lin et al. [22] | x | x | x | ||
Feng et al. [23] | x | x | |||
Ibrahim et al. [24] | x | x | x | x | |
Litaina et al. [25] | x | x | |||
Bouzid et al. [26] | x | x | |||
Mao et al. [27] | x | ||||
Merilehto et al. [28] | x | x | x | ||
Wiest et al. [29] | x | x | x | ||
Our repository | x | x | x | x | x |
- 1.
- An automated repository to efficiently manage large volumes of documents, which aggregates a wide set of cybersecurity-related documents from various sources in Portugal and the EU;
- 2.
- A set of user-friendly search and visualization tools to provide navigation and accessibility to documents;
- 3.
- A Git-Hub repository with the whole application and the tests developed so far, available at https://github.com/JoseMiguelFrade/Automated-Repository/tree/main (accessed on 1 March 2025).
2. Background
2.1. Relevant Documents in Cybersecurity
2.2. Issuer Organizations
- Legislation Issuers
- ‑
- Assembly of the Portuguese Republic [30]: The legislative body responsible for creating and approving laws in Portugal, including those related to cybersecurity.
- ‑
- Portuguese Government [31]: The executive branch that implements and enforces laws, including policies and regulations concerning cybersecurity.
- ‑
- European Council [32]: An EU institution responsible for setting the general governance of the European Union, including broad cybersecurity strategies.
- ‑
- European Commission [33]: The executive branch of the EU that is responsible for proposing legislation, implementing decisions, and managing the day-to-day business of the EU, including cybersecurity regulations and directives.
- ‑
- European Parliament [34]: The directly elected legislative body of the EU that works with the European Council to adopt and amend proposed legislation.
- Technical Norms and Framework Issuers
- ‑
- The Portuguese Cybersecurity Center (Centro Nacional de Cibersegurança—CNCS) serves as Portugal’s national authority for the promotion of cybersecurity and the strengthening of the country’s cyber resilience. Its core functions include monitoring and responding to cyberthreats, raising public awareness, offering expert guidance, collaborating with international partners, and supporting regulatory compliance [35].
- ‑
- The European Union Agency for Cybersecurity (ENISA) is the cybersecurity authority for the EU, committed to enhancing and coordinating cybersecurity efforts across member states [36].
- ‑
- The International Standards Organization (ISO) plays a significant role in cybersecurity by developing and promoting international standards that ensure the security and resilience of information systems and networks [37]. The most relevant standards related to cybersecurity and information security belong to the ISO-27000 standards family [38].
- ‑
- The Payment Card Industry Security Standards Council (PCI SSC) is a global framework designed to ensure the security of credit and debit card transactions and protect cardholder data. The PCI DSS provides a comprehensive set of requirements for the enhancement of payment card security [39].
2.3. Legal and Technical Documents
2.3.1. Laws and Decree Laws
2.3.2. Regulations
2.3.3. Directives
2.4. Technical Standards and Frameworks
2.5. Repositories
- StandICT [44]: This repository belongs to the European Standardization Observatory and contains many IT-related standards organized by different sectors.
- Cyber Policy Portal [45]: This is a United Nations (UN) portal that offers a comprehensive platform for access to a wide range of documents related to the regulation of cyberspace. The portal offers valuable insights into the institutions responsible for managing cyberspace in each country and includes detailed information on international agreements concerning cooperation in cyberspace.
- ENISA Repositories: ENISA offers repositories containing technical documents issued by EU member countries or by ENISA itself. Two repositories stand out: ENISA Publications [46], where ENISA publishes technical documents covering Europe’s cyberspace, regulations, emerging threats, and studies related to these topics; and ENISA’s National Cybersecurity Strategies map [47], which allows users to access documents pertaining to the cybersecurity strategies of all European countries.
- Octopus Cybercrime Community [48]: Managed by the Council of Europe, this portal hosts an extensive repository of cybercrime and handles digital evidence information, namely measures, policies, and legislation adopted by several countries world wide.
- OneTrust DataGuidance [49]: DataGuidance is a platform that provides an extensive range of information on cyberlaw, specifically focusing on data privacy legislation. It includes news, articles, and discussions conducted by experts in the field, addressing the major institutions, policies, and legislation that govern cyberspace and regulate data privacy.
- Eur-Lex [50]: The EUR-Lex repository is an online resource that provides access to EU regulations, directives, decisions, legislation, international agreements, and preparatory acts. The platform is designed to serve a wide range of users, including legal professionals, researchers, and other practitioners, by providing advanced search tools that facilitate the retrieval of documents.
- CNCS Observatory [51]: The Portuguese Cybersecurity Center periodically publishes reports and insights on the state of cybersecurity in Portugal. These reports cover a wide range of topics, including the general cybersecurity landscape in Portugal, the state of cybersecurity in specific sectors, and the economic impact of cybersecurity, as well as emerging threats and cyber conflicts. The CNCS also publishes technical norms aimed at strengthening the cybersecurity capabilities of national organizations. A prime example of this effort is the “Quadro Nacional de Referência para a Cibersegurança” [52] (National Cybersecurity Reference Framework). This framework serves as a comprehensive guide for organizations to enhance their cybersecurity measures and resilience.
- Diário Da República [53]: Diário da República’s portal is the official publication platform of the Portuguese Republic. The portal provides online access to all types of legal documents, including laws, decrees, resolutions, and regulatory norms issued by various branches of the government. The portal is divided into two series: one contains laws, decrees, and other acts of general legislative nature, while the other includes a wide range of other official documents, such as notices, declarations, and contracts.
2.6. The Necessity of Automated Tools for Document Analysis
3. Overall Repository Architecture
3.1. Repository Structure
- ChatGPT API: The utilization of the GPT Application Programming Interface (API) [55] (Section 3.2) allows the utilization of a large language model (LLM)-type AI model for the automation of various tasks related to document processing (Section 3.6).
- PDFCrawler: The PDF crawler is specialized in obtaining documents in PDF format from specified Uniform Resource Locators (URL) and is the focus of Section 3.4.
- Really Simple Syndication (RSS) Feed Consumer: This backend module is responsible for querying external repositories in search of news documents. The implementation of this functionality is described in Section 3.5.
3.2. GPT API
- Model: Specifies which GPT model processes the request. Available models include “gpt-4o”, “gpt-4-0125-preview” (GPT-4 Turbo), and “gpt-3.5-turbo-1106” (GPT-3.5 Turbo). Model versions and designations are regularly updated. This parameter is mandatory.
- Messages: Contains the prompts for the model. A message has two parts: “role” (context for the model) and “content” (the actual prompt). The first message should be in the System context (“role”: “system”) to guide the model to act like an assistant. Subsequent messages typically have the User context. This parameter is mandatory.
- Temperature: A value from 0 to 2 that influences the model’s responses. Lower values make the model more factual and precise, while higher values make it more creative. This parameter is optional.
- Top_p: Similar to “Temperature”, it controls the model’s behavior by adjusting the probability distribution of tokens. It is recommended to adjust “Temperature” or “Top_p”, but not both. The default value is 1.0. This parameter is optional.
- Max_tokens: Specifies the maximum number of tokens that the model can output. Tokens are basic units of text (words, parts of words, punctuation). This parameter helps to manage costs and, while not mandatory, is highly recommended.
3.3. MongoDB
3.4. PDFCrawler
3.5. RSS Feed Consumer
3.6. Automated Collection and Classification
- Title: Official document designation (e.g., Regulation (EU) 2016/679).
- Date: Date of first publication.
- Origin: Document origin (country, institution, etc.).
- Issuer: Entity that published the document (e.g., European Parliament).
- Subject: Document’s focus (e.g., security, data privacy, governance).
- Area: Sector targeted by the document (e.g., healthcare, justice).
- Related Documents: Directly related documents, if any.
- Abstract: Brief summary (maximum of 95 tokens) of the document.
3.7. Implementation Scenarios
4. Result Analysis
4.1. Regenerate Document Fields
- documentID: This is the ID that represents the document whose field is going to be regenerated.
- field: This is a value that represents the field that will be regenerated. This information will be used to select the correct prompt that will be sent to GPT to perform this operation.
- temperature: This is the creativity level, also known as the temperature, as explained in Section 3.2, at which GPT operates while generating its response. This level ranges from 0 (more technical) to 2 (more creative).
4.2. Relations Graph
4.3. Statistics
- Number of documents grouped by area, from all or different issuers—Figure 13;
- Number of documents issued over time and grouped by different origins—Figure 14;
- Cumulative and monthly count of documents present in the repository—Figure 15;
- Number of documents grouped by type (law, decree law, report, etc.)—Figure 16;
- Number of documents issued per year and per area—Figure 17.
4.4. New Documents Page
5. Discussion
- Using the tools available in the repositories to locate pages containing potential documents of interest and then submitting the page links to PDFCrawler;
- Consulting the “New Documents Page”, identifying potentially interesting documents, and then sending them to PDFCrawler using the user interface button of the same page.
- Title: As expected, most titles generated by GPT matched the official document titles. However, some titles were excessively long, and many were in all capital letters, reducing their visual appeal. Prompt refinement can address this issue.
- Subject: GPT accurately identified document subjects with minimal issues. Occasionally, it confused terms meant for the categorization of areas with those meant for subjects, leading to some inaccuracies. Again, some prompt refinements could easily prevent most of these occurrences.
- Areas: GPT performed well in straightforward cases but often labeled areas as “general” when interpretation or context was required.
- Related Documents: This attribute proved challenging, as GPT only identified documents as related if they were mentioned in the first 850 tokens provided for analysis, resulting in many documents lacking references in this category.
- Abstract: GPT excelled in generating concise yet informative abstracts that captured the documents’ key insights. Interestingly, some abstracts mentioned related documents not identified in the “Related Documents” attribute.
- Other Attributes: GPT had no significant issues generating the “issuer”, “origin”, “type”, and “date” attributes.
- AI-driven document selection: GPT saved tremendous effort in detecting documents that were not of interest in the area of study. Although many repositories allow for advanced query generation, these queries require user learning and expertise. Moreover, keyword-based queries often return documents that reference the keyword but are beyond the user’s context of interest. GPT selection proved to be a useful complementary filter, saving much effort in selecting documents of interest.
- Extracting basic document information: The titles, dates, and issuers of hundreds of documents were extracted with minimal effort and almost no human intervention.
- Dividing and organizing documents by area and subject: With some human refinement, the results given by AI were organized effectively.
- Producing valuable abstracts: No human interaction was needed to generate these summaries, allowing for a basic understanding of the content and objectives of the individual documents. Abstracts were generated for hundreds of documents in minutes, which is a very efficient result.
- Displaying relationships between documents: A network graph was created to show these relationships. Although some human intervention was needed, the effort was minimal, and this functionality provides valuable insights into the general landscape of the area’s documentation.
- Creating visual graphs: These graphs offer a different and statistical point of view of the current state of documentation, helping professionals to understand the temporal evolution of their areas of work and activity.
- Automated document collection: Developed with the help of PDFCrawler and adapted to our project, this tool removes much of the effort needed in gathering large volumes of documents. This can save time when searching for and downloading documents. The keyword feature, adaptable to any need, also saves effort and time by pre-selecting documents based on the provided keywords.
- Dedicated page for recent documents: This page notifies users of possible documents of interest that have been recently published, allowing professionals to stay updated and informed about the latest developments and to keep their documentation up to date.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Floridi, L.; Cowls, J. A unified framework of five principles for AI in society. In Machine Learning and the City: Applications in Architecture and Urban Design; Wiley: Hoboken, NJ, USA, 2022; pp. 535–545. [Google Scholar]
- Chng, S.; Lu, H.Y.; Kumar, A.; Yau, D. Hacker types, motivations and strategies: A comprehensive framework. Comput. Hum. Behav. Rep. 2022, 5, 100167. [Google Scholar] [CrossRef]
- Wang, C.; Zhang, N.; Wang, C. Managing privacy in the digital economy. Fundam. Res. 2021, 1, 543–551. [Google Scholar] [CrossRef]
- Quach, S.; Thaichon, P.; Martin, K.D.; Weaven, S.; Palmatier, R.W. Digital technologies: Tensions in privacy and data. J. Acad. Mark. Sci. 2022, 50, 1299–1323. [Google Scholar] [CrossRef] [PubMed]
- Lindgren, I.; Madsen, C.Ø; Hofmann, S.; Melin, U. Close encounters of the digital kind: A research agenda for the digitalization of public services. Gov. Inf. Q. 2019, 36, 427–436. [Google Scholar] [CrossRef]
- Usman, M.; Felderer, M.; Unterkalmsteiner, M.; Klotins, E.; Mendez, D.; Alégroth, E. Compliance requirements in large-scale software development: An industrial case study. In Proceedings of the Product-Focused Software Process Improvement: 21st International Conference, PROFES 2020, Turin, Italy, 25–27 November 2020; Proceedings 21. Springer: Berlin/Heidelberg, Germany, 2020; pp. 385–401. [Google Scholar]
- Kutyłowski, M.; Lauks-Dutka, A.; Yung, M. Gdpr–challenges for reconciling legal rules with technical reality. In Proceedings of the Computer Security–ESORICS 2020: 25th European Symposium on Research in Computer Security, ESORICS 2020, Guildford, UK, 14–18 September 2020; Proceedings, Part I 25. Springer: Berlin/Heidelberg, Germany, 2020; pp. 736–755. [Google Scholar]
- Filipovska, E.; Mladenovska, A.; Bajrami, M.; Dobreva, J.; Hillman, V.; Lameski, P.; Zdravevski, E. Benchmarking OpenAI’s APIs and other Large Language Models for Repeatable and Efficient Question Answering Across Multiple Documents. In Proceedings of the 2024 19th Conference on Computer Science and Intelligence Systems (FedCSIS), Belgrade, Serbia, 8–11 September 2024; IEEE: New York, NY, USA, 2024; pp. 107–117. [Google Scholar]
- Törnberg, P. How to use Large Language Models for Text Analysis. arXiv 2023, arXiv:2307.13106. [Google Scholar]
- GPT-3.5 Turbo. 2024. Available online: https://platform.openai.com/docs/models#gpt-3-5-turbo (accessed on 1 July 2024).
- GPT-4o. 2024. Available online: https://platform.openai.com/docs/models#gpt-4o (accessed on 1 July 2024).
- Ghumade, T.G.; Deshmukh, R. A document classification using NLP and recurrent neural network. Int. J. Eng. Adv. Technol. 2019, 8, 632–636. [Google Scholar] [CrossRef]
- Cascella, M.; Semeraro, F.; Montomoli, J.; Bellini, V.; Piazza, O.; Elena, B. The Breakthrough of Large Language Models Release for Medical Applications: 1-Year Timeline and Perspectives. J. Med. Syst. 2024, 48, 22. [Google Scholar] [CrossRef] [PubMed]
- Merchant, K.; Pande, Y. NLP Based Latent Semantic Analysis for Legal Text Summarization. In Proceedings of the 2018 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Bangalore, India, 19–22 September 2018; pp. 1803–1807. [Google Scholar] [CrossRef]
- Feyisa, D.W.; Berihun, H.; Zewdu, A.; Najimoghadam, M.; Zare, M. The future of document indexing: GPT and Donut revolutionize table of content processing. arXiv 2024, arXiv:2403.07553. [Google Scholar]
- Saka, A.; Taiwo, R.; Saka, N.; Salami, B.A.; Ajayi, S.; Akande, K.; Kazemi, H. GPT models in construction industry: Opportunities, limitations, and a use case validation. Dev. Built Environ. 2023, 17, 100300. [Google Scholar] [CrossRef]
- Savelka, J.; Agarwal, A.; Bogart, C.; Song, Y.; Sakr, M. Can Generative Pre-trained Transformers (GPT) Pass Assessments in Higher Education Programming Courses? In Proceedings of the 2023 Conference on Innovation and Technology in Computer Science Education V. 1. ACM, ITiCSE. Turku, Finland, 7–12 July 2023; 2023. [Google Scholar] [CrossRef]
- Liu, S.; Healey, C.G. Abstractive Summarization of Large Document Collections Using GPT. arXiv 2023, arXiv:2310.05690. [Google Scholar]
- Aladağ, F. The Potential of GPT in Ottoman Studies: Computational Analysis of Evliya Çelebi’s Travelogue with NLP and Text Mining and Digital Edition with TEI. Culture 2023, 5, 7. [Google Scholar]
- Thippeswamy, B.; Ramachandra, H.; Rohan, S.; Salam, R.; Pai, M. TextVerse: A Streamlit Web Application for Advanced Analysis of PDF and Image Files with and without Language Models. In Proceedings of the 2024 Asia Pacific Conference on Innovation in Technology (APCIT), Mysuru, India, 26–27 July 2024; IEEE: New York, NY, USA, 2024; pp. 1–6. [Google Scholar]
- Vallabhaneni, U.; Wutla, Y.; Dichpally, T.; Ch, V.R.R.; Gone, M.R.; Kumari, P.L. Mining Mate: A Chat Bot for Navigating Mining Regulations Using LLM Models. In Proceedings of the 2024 10th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, 14–15 March 2024; IEEE: New York, NY, USA, 2024; Volume 1, pp. 888–892. [Google Scholar]
- Lin, L.H.M.; Ting, F.K.; Chang, T.J.; Wu, J.W.; Tsai, R.T.H. GPT4ESG: Streamlining Environment, Society, and Governance Analysis with Custom AI Models. In Proceedings of the 2024 IEEE 4th International Conference on Electronic Communications, Internet of Things and Big Data (ICEIB), New Taipei, Taiwan, 19–21 April 2024; IEEE: New York, NY, USA, 2024; pp. 442–446. [Google Scholar]
- Feng, Y. Semantic textual similarity analysis of clinical text in the era of llm. In Proceedings of the 2024 IEEE Conference on Artificial Intelligence (CAI), Singapore, 25–27 June 2024; IEEE: New York, NY, USA, 2024; pp. 1284–1289. [Google Scholar]
- Ibrahim, I.M.; Attallah, M.S.; Abdel Hamid, S.O.; Zween, S.T.; Abuhadrous, I. Leveraging Large Language Models for Document Analysis and Decision-Making in AI Chatbots. Adv. Sci. Technol. J. 2025, 2, 1–16. [Google Scholar] [CrossRef]
- Litaina, T.; Soularidis, A.; Bouchouras, G.; Kotis, K.; Kavakli, E. Towards llm-based semantic analysis of historical legal documents. In Proceedings of the SemDH2024: First International Workshop of Semantic Digital Humanities, co-located with ESWC2024, Hersonissos, Greece, 26–27 May 2024. [Google Scholar]
- Bouzid, S.; Piron, L. Leveraging Generative AI in Short Document Indexing. Electronics 2024, 13, 3563. [Google Scholar] [CrossRef]
- Mao, Q.; Dabrowski, A.; Wei, F.; Olson, E.; Neary, R.; Yang, J.; Qin, H.; Huber-Fliflet, N. Comparative Analysis of LLM-Generated Event Timeline Summarization for Legal Investigations. In Proceedings of the 2024 IEEE International Conference on Big Data (BigData), Washington, DC, USA, 15–18 December 2024; IEEE: New York, NY, USA, 2024; pp. 4743–4750. [Google Scholar]
- Merilehto, J. From PDFs to Structured Data: Utilizing LLM Analysis in Sports Database Management. arXiv 2024, arXiv:2410.17619. [Google Scholar]
- Wiest, I.C.; Lessmann, M.E.; Wolf, F.; Ferber, D.; Van Treeck, M.; Zhu, J.; Ebert, M.P.; Westphalen, C.B.; Wermke, M.; Kather, J.N. Anonymizing medical documents with local, privacy preserving large language models: The LLM-Anonymizer. medRxiv 2024. [Google Scholar] [CrossRef]
- Assembleia da República. 2025. Available online: https://www.parlamento.pt (accessed on 30 January 2025).
- Governo de Portugal. 2025. Available online: https://www.portugal.gov.pt/pt/gc24/primeiro-ministro,organization=RepúblicaPortuguesa (accessed on 31 January 2025).
- European Council. 2025. Available online: https://www.consilium.europa.eu/en/european-council (accessed on 1 February 2025).
- European Commission, Official Website. 2025. Available online: https://commission.europa.eu/index_en (accessed on 1 February 2025).
- European Parliament. 2025. Available online: https://www.europarl.europa.eu/portal/en (accessed on 1 February 2025).
- CNCS—Centro Nacional de Cibersegurança. 2025. Available online: https://www.cncs.gov.pt (accessed on 1 February 2025).
- ENISA. 2021. Available online: https://www.enisa.europa.eu (accessed on 1 February 2025).
- ISO—International Organization for Standardization. 2025. Available online: https://www.iso.org/home.html (accessed on 1 February 2025).
- ISO-27000; Information Technology—Security Techniques—Information Security Management Systems—Overview and Vocabulary. ISO: Geneva, Switzerland, 2018.
- Official PCI Security Standards Council Site. 2025. Available online: https://www.pcisecuritystandards.org (accessed on 1 February 2025).
- No Direito Português, Qual a Diferença Entre Uma lei, Um Decreto-lei e uma Portaria? 2025. Available online: https://ffms.pt/pt-pt/direitos-e-deveres/no-direito-portugues-qual-diferenca-entre-uma-lei-um-decreto-lei-e-uma-portaria (accessed on 1 February 2025).
- Infopédia. Regulamento—Infopédia. 2025. Available online: https://www.infopedia.pt/apoio/artigos/$regulamento (accessed on 1 February 2025).
- Tipos de Legislação | União Europeia. 2025. Available online: https://european-union.europa.eu/institutions-law-budget/law/types-legislation_pt (accessed on 1 February 2025).
- Contribuidores dos Projetos da Wikimedia. Norma técnica – Wikipédia, a Enciclopédia Livre. 2023. Available online: https://pt.wikipedia.org/w/index.php?title=Norma_t%C3%A9cnica&oldid=66145026 (accessed on 1 February 2025).
- | StandICT.eu 2026. 2025. Available online: https://standict.eu (accessed on 1 February 2025).
- Cyber Policy Portal. 2025. Available online: https://cyberpolicyportal.org (accessed on 1 February 2025).
- Publications. 2025. Available online: https://www.enisa.europa.eu/publications#c3=2014&c3=2024&c3=false&c5=publicationDate&reversed=on&b_start=0 (accessed on 1 February 2025).
- National Cyber Security Strategies—Interactive Map. 2025. Available online: https://tools.enisa.europa.eu/topics/national-cyber-security-strategies/ncss-map/national-cyber-security-strategies-interactive-map (accessed on 1 February 2025).
- Country Wiki—Octopus Cybercrime Community— www.coe.int. 2025. Available online: https://www.coe.int/en/web/octopus/country-wiki (accessed on 1 February 2025).
- DataGuidance. 2025. Available online: https://www.dataguidance.com (accessed on 1 February 2025).
- EU Law—EUR-Lex. 2025. Available online: https://eur-lex.europa.eu/homepage.html?locale=en (accessed on 1 February 2025).
- CNCS—Observatório de Cibersegurança. 2025. Available online: https://www.cncs.gov.pt/pt/observatorio (accessed on 1 February 2025).
- CNCS—Quadro Nacional. 2025. Available online: https://www.cncs.gov.pt/pt/quadro-nacional (accessed on 1 February 2025).
- Diário da República. 2025. Available online: https://diariodarepublica.pt/dr/home (accessed on 1 February 2025).
- MongoDB: The Developer Data Platform. 2025. Available online: https://www.mongodb.com (accessed on 1 February 2025).
- OpenAI API. 2025. Available online: https://openai.com/api (accessed on 1 February 2025).
- Welcome to Flask—Flask Documentation (3.0.x). 2025. Available online: https://flask.palletsprojects.com/en/stable/changes/#version-3-0-0 (accessed on 1 February 2025).
- Vuetify—A Vue Component Framework. 2025. Available online: https://vuetifyjs.com/en/#installation (accessed on 1 February 2025).
- Vis-Network. 2025. Available online: https://github.com/visjs/vis-network (accessed on 1 February 2025).
- Chart.js. 2025. Available online: https://www.chartjs.org (accessed on 21 May 2024).
- ChatGPT. 2025. Available online: https://openai.com/chatgpt (accessed on 1 February 2025).
- OpenAI. 2025. Available online: https://openai.com (accessed on 10 February 2025).
- OpenAI Models. 2025. Available online: https://platform.openai.com/docs/models (accessed on 1 February 2025).
- Pricing. 2024. Available online: https://openai.com/api/pricing/ (accessed on 1 March 2025).
- SimFin. Pdf-Crawler. 2025. Available online: https://github.com/SimFin/pdf-crawler/tree/master (accessed on 1 February 2025).
- Openai. Tiktoken. 2025. Available online: https://github.com/openai/tiktoken (accessed on 1 February 2025).
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Frade, J.; Antunes, M. An Automated Repository for the Efficient Management of Complex Documentation. Information 2025, 16, 205. https://doi.org/10.3390/info16030205
Frade J, Antunes M. An Automated Repository for the Efficient Management of Complex Documentation. Information. 2025; 16(3):205. https://doi.org/10.3390/info16030205
Chicago/Turabian StyleFrade, José, and Mário Antunes. 2025. "An Automated Repository for the Efficient Management of Complex Documentation" Information 16, no. 3: 205. https://doi.org/10.3390/info16030205
APA StyleFrade, J., & Antunes, M. (2025). An Automated Repository for the Efficient Management of Complex Documentation. Information, 16(3), 205. https://doi.org/10.3390/info16030205