Skip to content
This repository has been archived by the owner on Dec 4, 2018. It is now read-only.

x4v13r64/CMSpyder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CircleCI Code Issues Code Health codecov

Project

Distributed www crawler/scrapper with:

  • CMS detection
    • Plugin-based architecture
    • Version and plugin detection
    • Maintains historic versions
  • Subdomain light bruteforcer (common subdomains e.g. blog., store.)
  • Subdomain scrapper (1-deep)
  • Subdirectory light bruteforcer (common directories e.g. /blog, /wp)
  • Web interface for results and monitoring
  • Static website to view results
    • Updated with daily statistics

Milestones

  1. Prototype django application using celery
  2. Initial architecture for plugin-based detection
  3. Initial bakend configuration
  4. Third-party CI integration
  5. Second iteration
  6. Move GitHub site to Jekyll
  7. Container (docker) build
  8. Third iteration
    1. Subdomain discovery
    2. Subdirectory discovery
    3. Limit crawling of subnets (e.g. 5 min wait per /24)
    4. Bug fixes
    5. Test cases
  9. Distributed architecture
    1. MongoDB cluster
  10. Network hardening
  11. Better Jekyll website
    1. Statistics, graphs, text search
  12. Elasticsearch integration
  13. TBD

Running

  • Start the RabbitMQ server $ rabbitmq-server
  • Start the worker process $ DJANGO_SETTINGS_MODULE='cmspyder.settings.dev' celery -A cmspyder worker --concurrency=50 --pool=eventlet -Q cmspyder_detect_cms_queue, cmspyder_discover_domains_queue
  • Start the master process $ python manage.py runserver 0.0.0.0:8080 --settings=cmspyder.settings.dev

About

distributed www CMS spyder

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published