org.apache.lucene.benchmark.utils
Class ExtractReuters
public class ExtractReuters
Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body
void | extract()
|
protected void | extractFile(File sgmFile) - Override if you wish to change what is extracted
|
static void | main(String[] args)
|
ExtractReuters
public ExtractReuters(File reutersDir,
File outputDir)
extract
public void extract()
extractFile
protected void extractFile(File sgmFile)
Override if you wish to change what is extracted
main
public static void main(String[] args)
Copyright © 2000-2007 Apache Software Foundation. All Rights Reserved.