NAME

    Catmandu::HTML - Modules for handling HTML data within the Catmandu
    framework

SYNOPSIS

    Command line client catmandu:

      catmandu convert HTML to JSON < ex/test.html > data/test.json
    
      catmandu convert JSON to HTML < data/test.json > ex/test.html
    
      # Keep the text
      curl -s http://librecat.org |
        catmandu convert HTML to Text --fix "html_text()"
    
      # Filter out the anchors
      curl -s http://librecat.org |
        catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")"
    
      # Filter out the meta tags as YAML
      curl -s http://librecat.org |
        catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)"

    See documentation of modules for more examples.

AVAILABLE MODULES

    Catmandu::Exporter::HTML

      Serialize HTML data

    Catmandu::Importer::HTML

      Parse HTML data

    Catmandu::Fix::html_filter_tag

      Filter HTML by name and optional group them

    Catmandu::Fix::html_filter_type

      Filter HTML by type

    Catmandu::Fix::html_text

      Select only the literal text values from the HTML

SEE ALSO

    This module is based on Catmandu

AUTHORS

    Patrick Hochstenbach, patrick.hochstenbach at ugent.be

COPYRIGHT AND LICENSE

    The Perl software is copyright (c) 2018 by Patrick Hochstenbach. This
    is free software; you can redistribute it and/or modify it under the
    same terms as the Perl 5 programming language system itself.