Interface for the DBLP-Parser to save data into an Accumulo instance as a Graphulo sparse matrix. Implementation of algorithms to calculate weakly and strongly connected components and analysis. Written for the Big Data Praktikum at the University of Leipzig.
Maven, Gradle, Accumulo
Tested on Ubuntu 16.04
- Run install_acc.sh and execute the manual commands
- Run graphulo.sh
- Download DBLP-Parser
- Run
gradle build
in DBLP-Parser - Run build_and_copy.sh
- Download the DBLP data and add it to resources
Infrastructure contains classes to connect to Accumulo and an abstraction for "Graphulo-formed" Accumulo-Entries.
To use the DBLP-data, you can implement a DBLP-Processor, check this and this.
Calculate components (really slow) using this. For analysis use this.