+
Skip to content

erlitx/cid_parser

Repository files navigation

CID Parser

./bulk_parse.py

Scrapes one or more chipdip.ru product pages and outputs a CSV with the columns:

  1. URL of the page being scraped
  2. JSON of the product description found
  3. Markdown snippet derived from the description

Example usage:

# Scrape single page
./bulk_parse.py https://www.chipdip.ru/product0/8003076409

# Scrape multiple and save to file
./bulk_parse.py https://www.chipdip.ru/product0/8003076409 https://www.chipdip.ru/product/spu01m-05 > out.csv

# Scrape multiple from file (one URL per line) and save to file
cat examples/product_page_list.txt | xargs ./bulk_parse.py > out.csv

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •  

Languages

点击 这是indexloc提供的php浏览器服务,不要输入任何密码和下载