Influx

Prototype for an integrated content-based language learning environment.

Links

Devlog here
Proposal here

How it looks like right now, with working multilingual sentence segmentation and tokenization:

Development notes

Architecture

SurrealDB + Axum + Disk as backend service exposing an API
Python + Stanza via PyO3 for NLP
Svelte frontend that interacts with the API
Tauri as a desktop client
fsrs-rs for SRS algorithm

Plan

(only a partial plan)

Phase I - Project Skeleton

Phase II - Packaging

tauri wrapper
figure out how to package python dependencies (check https://pyo3.rs/v0.14.2/building_and_distribution.html)
document set up process
build CI

Phase III - Frontend Usability

feedback messages

Phase IV - Frontend Language Learning Features

dictionary
translation
TTS
sentence structure analysis?

Phase V - Code Quality

error handling
documentation
security and accounts?

Phase ? - Future

markdown rendering?
video support
audio support
pdf + ocr support?

For future self

Use toml = "0.8.8" for toml settings parsing and editing.
Current implementation is for rapid development. Change all unwrap to proper error handling.
File on disk could lead to race condition, but probabily won't encounter in single user situation
Language settings could be on disk
security? account? whatever for now as it's localhost

Running development server

Setting up python

Try not using conda, it didn't work Try not using mac's built-in python, it didn't work Installing stanza in virtual environment doesn't work for some reason. have to install it on the system python

brew install python@3.10
brew install pipenv
python3.10 -m pip install stanza
pipenv install
pipenv shell

rm /opt/homebrew/Cellar/python\@3*/**/EXTERNALLY-MANAGED

Running influx server

cd influx_api
cargo run

Running frontend

cd influx_ui
npm run dev

API design

Method defaults to GET is unspecified

/ returns something random
/settings returns app settings as json
- /langs returns list of languages in settings
/vocab to work with vocabs
- /vocab/token/{lang_identifier}/{orthography} to query for a single token?
- POST /vocab/create_token to create a token
- POST /vocab/update_token to update a token
- DELETE /vocab/delete_token to update a token
docs to work with docs
- /docs/{lang_identifier} returns list of content, with metadata, for the language specified by lang_identifier. Currently only supports markdown content.
  - /docs/{lang_identifier}/{filename} returns a specific piece of content, with metadata, text, tokenised text, and results from querying vocabulary database

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
influx_api		influx_api
influx_ui		influx_ui
src		src
toy_content		toy_content
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Influx

Development notes

Architecture

Plan

For future self

Running development server

Setting up python

Running influx server

Running frontend

API design

About

Uh oh!

Releases

Packages

Languages

Ting2004/Influx

Folders and files

Latest commit

History

Repository files navigation

Influx

Development notes

Architecture

Plan

For future self

Running development server

Setting up python

Running influx server

Running frontend

API design

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages