Section: User Commands (1)
Updated: January 2021
Page Index


python3-html2text - manual page for python3-html2text 2020.1.16  


usage: python3-html2text [-h] [--default-image-alt DEFAULT_IMAGE_ALT]
[--pad-tables] [--no-wrap-links] [--wrap-list-items]
[--ignore-emphasis] [--reference-links] [--ignore-links] [--protect-links] [--ignore-images] [--images-as-html] [--images-to-alt] [--images-with-size] [-g] [-d] [-e] [-b BODY_WIDTH] [-i LIST_INDENT] [-s] [--escape-all] [--bypass-tables] [--ignore-tables] [--single-line-break] [--unicode-snob] [--no-automatic-links] [--no-skip-internal-links] [--links-after-para] [--mark-code] [--decode-errors DECODE_ERRORS] [--open-quote OPEN_QUOTE] [--close-quote CLOSE_QUOTE] [--version] [filename] [encoding]

positional arguments:

filename encoding

optional arguments:

-h, --help
show this help message and exit
--default-image-alt DEFAULT_IMAGE_ALT
The default alt string for images with missing ones
pad the cells to equal column width in tables
don't wrap links during conversion
wrap list items during conversion
don't include any formatting for emphasis
use reference style links instead of inline links
don't include any formatting for links
protect links from line breaks surrounding them with angle brackets
don't include any formatting for images
Always write image tags as raw html; preserves `height`, `width` and `alt` if possible.
Discard image data, only keep alt text
Write image tags with height and width attrs as raw html to retain dimensions
-g, --google-doc
convert an html-exported Google Document
-d, --dash-unordered-list
use a dash rather than a star for unordered list items
-e, --asterisk-emphasis
use an asterisk rather than an underscore for emphasized text
-b BODY_WIDTH, --body-width BODY_WIDTH
number of characters per output line, 0 for no wrap
-i LIST_INDENT, --google-list-indent LIST_INDENT
number of pixels Google indents nested lists
-s, --hide-strikethrough
hide strike-through text. only relevant when -g is specified as well
Escape all special characters. Output is less readable, but avoids corner case formatting issues.
Format tables in HTML rather than Markdown syntax.
Ignore table-related tags (table, th, td, tr) while keeping rows.
Use a single line break after a block element rather than two line breaks. NOTE: Requires --body-width=,0/
Use unicode throughout document
Do not use automatic links wherever applicable
Do not skip internal links
Put links after each paragraph instead of document
Mark program code blocks with [code]...[/code]
--decode-errors DECODE_ERRORS
What to do in case of decode errors.'ignore', 'strict' and 'replace' are acceptable values
--open-quote OPEN_QUOTE
The character used to open quotes
--close-quote CLOSE_QUOTE
The character used to close quotes
show program's version number and exit



positional arguments:
optional arguments: