Skip to content

爬取CSDN的单篇文章或某一分类专栏下的所有文章并转化为markdowm(支持简书,知乎)

License

Notifications You must be signed in to change notification settings

oneJue/article2md

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

爬取博客文章保存为md

(纯净版去除多余内容,支持多平台(CSDN,简书,知乎等))


Two Method to choose

article2md(one).py:Method one

article2md(two).py:Method two(推荐)


article2批量版.py:对CSDN的某一专栏下全部文章进行转化成markdown

类似爬取下面这种一系列的文章


依赖

使用pip进行安装

pip install html2text
pip install lxml
pip install requests
pip install beautifulsoup4

  • Method one:

思路:爬取目标html后利用html2text模块转化成markdown


  • Method two:

思路:调用网站HTML/URL To Markdown的功能实现转化


相关链接

【python】将博客文章保存为markdown)

About

爬取CSDN的单篇文章或某一分类专栏下的所有文章并转化为markdowm(支持简书,知乎)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages