8000 Proposal: add parser adapter using myhtmlex · Issue #171 · philss/floki · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Proposal: add parser adapter using myhtmlex #171

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
f34nk opened this issue Feb 22, 2018 · 2 comments
Closed

Proposal: add parser adapter using myhtmlex #171

f34nk opened this issue Feb 22, 2018 · 2 comments

Comments

@f34nk
Copy link
f34nk commented Feb 22, 2018

Myhtmlex
A module to decode html into a tree structure.
Based on Alexander Borisov’s myhtml, this binding gains the properties of being html-spec compliant and very fast.

see @Overbryd comment

I also wrote https://github.com/Overbryd/nodex that can be used to provide a safe execution (c-)node to give the best in performance/safety.
I would refrain from using myhtmlex widely as a NIF without explicitly checking the crash-safety requirements of the application requiring it. So maybe providing two modes of operation (NIF and C-Node) might be the best way to go for a widely used package.

see myhtmlex branch: c-node

@alex88
Copy link
alex88 commented Apr 16, 2019

Is there an update on this or any way to speed up the use of find and parsing? Trying to profile our app we're seeing that mochiweb_html functions are those taking most of the time

Update: I've just tried to use html5ever but it seems even slower than mochiweb 😕

@philss
Copy link
Owner
philss commented Jan 1, 2020

@f34nk @alex88 thanks to @rinpatch (#240), we now can use myhtmlex through fast_html.

Please try the version from master/GitHub. I'm going to release this with the version 0.24.

@philss philss closed this as completed Jan 1, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants
0