Skip to content

veronicaads/arjuna-pyconid2021

Repository files navigation

Arjuna: Generate Indonesian Poets and Poems using python base NLP model

Arjuna is a NLP Project to generate poets and poems based on Bahasa language. This repository is sources that used at PyCon ID 2021 talks created by @miqdude and @veronicaads.

This repository consist two main folder: scraper and model.

Scraper

This folder contains python script that used to do web scraping to https://www.kompas.id/kategori/sastra/ as dataset for the model.

You need to register to https://www.kompas.id first before use this scraper to fill the email and password parameter.

Please install all the requirements library at requirements folder : pip install -r requirements.txt

python kompas_sastra_scraper.py --user_email [email protected] --password xxxxxxxxxx --depth 5

Model

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published