8000 GitHub - nl2logql/LogQLLM
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

nl2logql/LogQLLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Chatting with Logs: An exploratory study on Finetuning LLMs for LogQL

This repository contains the scripts for creating the dataset and code for the paper "Chatting with Logs: An exploratory study on Finetuning LLMs for LogQL".

The dataset and the fine-tuned models are available on HuggingFace under the following links:

The demo for the fine-tuned models is available at this link

System Logs

logs/: This directory containers the system logs and their respective transformation and ingestion scripts for each of the three systems: OpenSSH, OpenStack, and HDFS.

Respective logs are stored in the following directories:

Order of running the scripts:

  1. python filter.py: Filters the log headers and messages.
  2. python generate_labels.py: Generates the labels and structured metadata into parsed_<app>_logs.json
  3. python update_timestamps.py: Updates the timestamps in the parsed logs from based on datetime.now() and relative time difference. This is done because Grafana Loki does not support ingesting out-of-order logs. Link
  4. python upload_to_loki.py: Uploads the logs to Grafana Loki.

Dataset Curation

dataset-curation/: This directory contains the scripts for creating the natural language to LogQL dataset. The dataset is created by pairing the natural language queries with the LogQL queries.

Fine-Tuning

finetuning/: This directory contains the scripts for fine-tuning the LLMs on the dataset.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 5

0