针对wordpress网站爬取文章,图片,评论等数据。

liuyuqi-dellpc 587fa6fc7d 测试项目 6 years ago
conf d4599f23f2 go on 6 years ago
utils d4599f23f2 go on 6 years ago
.gitignore d4599f23f2 go on 6 years ago
LICENSE bcd574bd93 Initial commit 6 years ago
README.md d4599f23f2 go on 6 years ago
crawl-menshijian8-images-splinter.py d4599f23f2 go on 6 years ago
crawl-menshijian8-images.py d4599f23f2 go on 6 years ago
crawl-xx-xmlrpc.py d4599f23f2 go on 6 years ago
db-manager.py d4599f23f2 go on 6 years ago
main.py d4599f23f2 go on 6 years ago
main2.py d4599f23f2 go on 6 years ago
test-phantomjs.py d4599f23f2 go on 6 years ago
test-re.ipynb 587fa6fc7d 测试项目 6 years ago
test-re.py 587fa6fc7d 测试项目 6 years ago
test-selenium.py d4599f23f2 go on 6 years ago
test-splinter.py d4599f23f2 go on 6 years ago
test.py 587fa6fc7d 测试项目 6 years ago
test2.py 587fa6fc7d 测试项目 6 years ago
test3.py 587fa6fc7d 测试项目 6 years ago
test4.py 587fa6fc7d 测试项目 6 years ago

README.md

crawl-wordpress

针对wordpress网站爬取文章,图片,评论等数据。

    sudo pip3 install python-wordpress-xmlrpc
    sudo pip3 install python-crontab