Command line interface for Defuddle. Extract clean HTML or Markdown from pages.
npm install -g defuddle-cli
defuddle parse <source> [options]
source
: HTML file path or URL to parse
-o, --output <file>
: Output file path (default: stdout)-m, --markdown, --md
: Convert content to markdown-j, --json
: Output as JSON with both HTML and markdown content-p, --property <name>
: Extract a specific property (e.g., title, description, domain)--debug
: Enable debug mode-h, --help
: Display help for command
Parse a local HTML file (outputs HTML):
defuddle parse article.html
Parse a URL and convert to markdown:
defuddle parse https://example.com/article --md
Parse and get the full JSON response from Defuddle:
defuddle parse article.html --json
Save markdown output to a file:
defuddle parse article.html --md -o output.md
Extract specific properties:
# Get just the title
defuddle parse article.html --property title
# Get the description
defuddle parse article.html -p description
# Get the domain
defuddle parse article.html --property domain
# Install dependencies
npm install
# Build
npm run build
# Run in development mode
npm run dev