SlackBuilds Repository

15.0 > Libraries > BeautifulSoup (3.2.1)

Beautiful Soup is a Python HTML/XML parser designed for quick
turnaround projects like screen-scraping. Three features make it

1. Beautiful Soup won't choke if you give it bad markup. It yields a
parse tree that makes approximately as much sense as your original
document. This is usually good enough to collect the data you need
and run away.

2. Beautiful Soup provides a few simple methods and Pythonic idioms
for navigating, searching, and modifying a parse tree: a toolkit
for dissecting a document and extracting what you need. You don't
have to create a custom parser for each application.

3. Beautiful Soup automatically converts incoming documents to Unicode
and outgoing documents to UTF-8. You don't have to think about
encodings, unless the document doesn't specify an encoding and
Beautiful Soup can't autodetect one. Then you just have to
specify the original encoding.

Beautiful Soup parses anything you give it, and does the tree
traversal stuff for you. You can tell it "Find all the links", or
"Find all the links of class externalLink", or "Find all the links
whose urls match "", or "Find the table heading that's got bold
text, then give me that text."

Valuable data that was once locked up in poorly-designed websites is
now within your reach. Projects that would have taken hours take only
minutes with Beautiful Soup.

Maintained by: Marco Bonetti
Keywords: network,screen scraping,parser,parsing
ChangeLog: BeautifulSoup


Source Downloads:
BeautifulSoup-3.2.1.tar.gz (5ad1a8550a19bfc945baac23eb8293ed)

Download SlackBuild:
BeautifulSoup.tar.gz.asc (FAQ)

(the SlackBuild does not include the source)

Validated for Slackware 15.0

See our HOWTO for instructions on how to use the contents of this repository.

Access to the repository is available via:
ftp git cgit http rsync

© 2006-2023 Project. All rights reserved.
Slackware® is a registered trademark of Patrick Volkerding
Linux® is a registered trademark of Linus Torvalds