Use Python to investigate the ScienceDirect database

This folder of codes is crafted by Zihan Zhang for his lab internship at McDevitt Research Group, NYU College of Dentistry. The purpose is to get access to information from ScienceDirect database in using a web crawler. The codes are primary python-based and a few Matlab and SQL involved.

The heatmap_create section is used to generate a visual presentation of the total found results of different keywords combinations. The download_pdf section is used to download all pdfs relevant to this keyword combinations and store them into different folders. The database_create section is used to extract important information related to each article, including related keywords, DOI, authors, published date, etc., and store them into the MySQL database. The analyze_pdf section is used to transform the pdf file into an editable and searchable JSON file so that the users could trace for specific keywords and go over through the contents.

Notice that the codes in each folder are more or less similar to each other. That is for readers' convenience to treat them as separate projects for future reference.

I also post the interim presentation of the data extraction team to better introduce my role and contributions to the trauma project. The final well-organized program can be found at here.

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
analyze_pdf		analyze_pdf
database_create		database_create
download_pdf		download_pdf
heatmap_create		heatmap_create
Progress Presentation [08_07_20].pdf		Progress Presentation [08_07_20].pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Use Python to investigate the ScienceDirect database

About

Releases

Packages

Languages

StevenZhang0116/ScienceDirectWebCrawler

Folders and files

Latest commit

History

Repository files navigation

Use Python to investigate the ScienceDirect database

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages