WARC Input and Output Formats for Hadoop

Java library for working with WARC (Web Archive) files in Hadoop MapReduce

Homepage POM file JAR file Javadoc
'com.martinkl.warc:warc-hadoop:0.1.0'

Dependencies

Compile dependencies

Test dependencies