Local Text Scraper is a Python application that allows you to scrape text from files in a specified directory and combine them into a single output file. It features a user-friendly Gradio interface for easy interaction.
- Scrape text from files with a specific extension in a given directory
- Combine all scraped text into a single output file
- User-friendly Gradio interface for easy interaction
- Option to specify custom output directory or use default
-
Clone this repository:
git clone https://github.com/tekierz/LocalTextScraper.git
-
Navigate to the project directory:
cd LocalTextScraper
-
Install the required dependencies:
pip install gradio
-
Run the script:
python main.py
-
The Gradio interface will launch in your default web browser.
-
Enter the following information in the interface:
- Input Directory: The directory containing the files you want to scrape
- File Extension: The extension of the files you want to scrape (e.g., .txt)
- Output Directory (optional): The directory where you want to save the combined output file. If left blank, it will use a default "Outputs" folder in the same directory as the script.
-
Click "Submit" to start the scraping process.
-
The script will display a message indicating where the combined output file has been saved.
Contributions are welcome! Please feel free to submit a Pull Request.