A Python tool to efficiently check the existence of files in Azure Blob Storage using a CSV input file.
- Python 3.7+
- Azure Storage account connection string
- CSV file with a 'FILENAME' column
# Clone the repository
git clone https://github.com/dannotes/azure-blob-existence-checker.git
# Navigate to the project directory
cd azure-blob-existence-checker
# Create a virtual environment (optional but recommended)
python3 -m venv venv
source venv/bin/activate # On Windows, use `venv\Scripts\activate`
# Install the package
pip install -r requirements.txt
pip install git+https://github.com/dannotes/azure-blob-existence-checker.git
blob_checker.py "YOUR_CONNECTION_STRING" "CONTAINER_NAME" "path/to/input.csv"
blob_checker.py "YOUR_CONNECTION_STRING" "CONTAINER_NAME" "path/to/input.csv" -export csv
Your input CSV must have a column named 'FILENAME' containing the blob names to check.
Example:
FILENAME,OtherColumn1,OtherColumn2
file1.txt,Data1,Value1
file2.jpg,Data2,Value2
- Concurrent blob existence checking
- Colorful terminal output
- Detailed summary of existing and non-existing blobs
- Optional CSV export of results
- Progress tracking
- azure-storage-blob
- tabulate
- colorama
- Ensure your connection string is correct
- Verify container name matches exactly
- Check that the CSV file is properly formatted