Skip to content

Latest commit

 

History

History
95 lines (87 loc) · 2.02 KB

README.md

File metadata and controls

95 lines (87 loc) · 2.02 KB


zetanize
zetanize

HTML Form Parser For Humans

Introduction

It's very easy to make HTTP requests in python, thanks to urllib and requests. However, there was no way to submit HTML forms on the go, well now there is.

Documentation

from zetanize import zetanize
forms = zetanize(html)

Well that's it! Just feed zetanize a HTML document and it will give you a dict of actionable form data.
Let's parse https://google.com for getting familiar:

from requests import get
from zetanize import zetanize

html = get('https://google.com').text
forms = zetanize(html)

Here's the output:

{
  "0": {
    "action": "/search", 
    "inputs": [
      {
        "type": "hidden", 
        "name": "ie", 
        "value": "ISO-8859-1"
      }, 
      {
        "type": "hidden", 
        "name": "hl", 
        "value": "en-IN"
      }, 
      {
        "type": "hidden", 
        "name": "source", 
        "value": "hp"
      }, 
      {
        "type": "hidden", 
        "name": "biw", 
        "value": ""
      }, 
      {
        "type": "hidden", 
        "name": "bih", 
        "value": ""
      }, 
      {
        "type": "", 
        "name": "q", 
        "value": ""
      }, 
      {
        "type": "submit", 
        "name": "btnG", 
        "value": "Google Search"
      }, 
      {
        "type": "submit", 
        "name": "btnI", 
        "value": "I"
      }, 
      {
        "type": "hidden", 
        "name": "gbv", 
        "value": "1"
      }
    ], 
    "method": "get"
  }
}