uk.ac.man.documentparser.input.util
Class Splitter
java.lang.Object
uk.ac.man.documentparser.input.util.Splitter
- All Implemented Interfaces:
- java.lang.Iterable<Document>, java.util.Iterator<Document>, DocumentIterator
public class Splitter
- extends java.lang.Object
- implements DocumentIterator
Adaptor Class for splitting documents on x number of sentences
- Author:
- Martin
Constructor Summary |
Splitter(DocumentIterator documents,
int sentencesPerSplit)
Create a documentiterator adaptor, which will return documents, with a maximum of "sentencesPerSplit" sentences
Uses martin.common.SentenceSplitter. |
Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
sentencesPerSplit
private int sentencesPerSplit
documents
private DocumentIterator documents
currentTexts
private java.util.LinkedList<java.lang.String> currentTexts
currentIDs
private java.util.LinkedList<java.lang.String> currentIDs
Splitter
public Splitter(DocumentIterator documents,
int sentencesPerSplit)
- Create a documentiterator adaptor, which will return documents, with a maximum of "sentencesPerSplit" sentences
Uses martin.common.SentenceSplitter.
- Parameters:
documents
- sentencesPerSplit
-
getNext
private void getNext()
skip
public void skip()
- Specified by:
skip
in interface DocumentIterator
hasNext
public boolean hasNext()
- Specified by:
hasNext
in interface java.util.Iterator<Document>
next
public Document next()
- Specified by:
next
in interface java.util.Iterator<Document>
remove
public void remove()
- Specified by:
remove
in interface java.util.Iterator<Document>
iterator
public java.util.Iterator<Document> iterator()
- Specified by:
iterator
in interface java.lang.Iterable<Document>