Class JoiningSequenceReader<C extends Compound>
java.lang.Object
org.biojava3.core.sequence.storage.JoiningSequenceReader<C>
- Type Parameters:
C
- Tyoe of compound to hold
- All Implemented Interfaces:
Iterable<C>
,Accessioned
,ProxySequenceReader<C>
,Sequence<C>
,SequenceReader<C>
public class JoiningSequenceReader<C extends Compound>
extends Object
implements ProxySequenceReader<C>
This reader actually proxies onto multiple types of sequence in order to allow a number of
sequence objects to act as if they are one sequence. The code takes in any number of sequences,
records the minimum and maximum bounds each sequence covers with respect to 1 position indexing
and then binary searches these when a position is requested. Because of this 0 length Sequences
are excluded during construction.
Performance is not as good as if you are using a flat sequence however the speed of lookup is
more than adaquate for most situations. Using the iterator gives the best performance as this
does not rely on the binary search mechanism instead iterating through each sequence in turn.
- Author:
- ayates
-
Constructor Summary
ConstructorDescriptionJoiningSequenceReader
(List<Sequence<C>> sequences) Allows creation of the store from List<Sequence>. JoiningSequenceReader
(CompoundSet<C> compoundSet, List<Sequence<C>> sequences) JoiningSequenceReader
(CompoundSet<C> compoundSet, Sequence<C>... sequences) JoiningSequenceReader
(Sequence<C>... sequences) Allows creation of the store from Vargs Sequenceobjects. -
Method Summary
Modifier and TypeMethodDescriptionint
countCompounds
(C... compounds) Returns the number of times we found a compound in the SequenceReturns the AccessionID this location is currently bound withReturns the Sequence as a List of compoundsgetCompoundAt
(int position) Returns the Compound at the given biological indexGets the compound set used to back this Sequenceint
getIndexOf
(C compound) Scans through the Sequence looking for the first occurrence of the given compoundDoes the right thing to get the inverse of the current Sequence.int
getLastIndexOf
(C compound) Scans through the Sequence looking for the last occurrence of the given compoundint
Returns the length of the SequenceReturns the String representation of the SequencegetSubSequence
(Integer start, Integer end) Returns a portion of the sequence from the different positions.iterator()
Iterator implementation which attempts to move through the 2D structure attempting to skip onto the next sequence as & when it is asked tovoid
setCompoundSet
(CompoundSet<C> compoundSet) void
setContents
(String sequence) Methods inherited from class java.lang.Object
equals, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
Methods inherited from interface java.lang.Iterable
forEach, spliterator
-
Constructor Details
-
JoiningSequenceReader
Allows creation of the store from Vargs Sequenceobjects. CompoundSet defaults to the first element of the array (assuming there are elements available during construction otherwise we will throw an illegal state exception). -
JoiningSequenceReader
Allows creation of the store from List<Sequence>. CompoundSet defaults to the first element of the List (assuming there are elements available during construction otherwise we will throw an illegal state exception). -
JoiningSequenceReader
-
JoiningSequenceReader
-
-
Method Details
-
getCompoundAt
Description copied from interface:Sequence
Returns the Compound at the given biological index- Specified by:
getCompoundAt
in interfaceSequence<C extends Compound>
- Parameters:
position
- Biological index (1 to n)- Returns:
- Compound at the specified position
-
getCompoundSet
Description copied from interface:Sequence
Gets the compound set used to back this Sequence- Specified by:
getCompoundSet
in interfaceSequence<C extends Compound>
-
getLength
public int getLength()Description copied from interface:Sequence
Returns the length of the Sequence -
iterator
Iterator implementation which attempts to move through the 2D structure attempting to skip onto the next sequence as & when it is asked to -
setCompoundSet
- Specified by:
setCompoundSet
in interfaceSequenceReader<C extends Compound>
-
setContents
- Specified by:
setContents
in interfaceSequenceReader<C extends Compound>
-
countCompounds
Description copied from interface:Sequence
Returns the number of times we found a compound in the Sequence- Specified by:
countCompounds
in interfaceSequence<C extends Compound>
- Parameters:
compounds
- Vargs of the compounds to count- Returns:
- Number of times a compound was found
-
getAccession
Description copied from interface:Accessioned
Returns the AccessionID this location is currently bound with- Specified by:
getAccession
in interfaceAccessioned
- Throws:
UnsupportedOperationException
-
getAsList
Description copied from interface:Sequence
Returns the Sequence as a List of compounds -
getIndexOf
Description copied from interface:Sequence
Scans through the Sequence looking for the first occurrence of the given compound- Specified by:
getIndexOf
in interfaceSequence<C extends Compound>
- Parameters:
compound
- Compounds to look for- Returns:
- Index of the first position of the compound in the sequence (1 to n)
-
getLastIndexOf
Description copied from interface:Sequence
Scans through the Sequence looking for the last occurrence of the given compound- Specified by:
getLastIndexOf
in interfaceSequence<C extends Compound>
- Parameters:
compound
- Compounds to look for- Returns:
- Index of the last position of the compound in the sequence (1 to n)
-
getSequenceAsString
Description copied from interface:Sequence
Returns the String representation of the Sequence- Specified by:
getSequenceAsString
in interfaceSequence<C extends Compound>
-
getSubSequence
Description copied from interface:Sequence
Returns a portion of the sequence from the different positions. This is indexed from 1- Specified by:
getSubSequence
in interfaceSequence<C extends Compound>
- Parameters:
start
- Biological index start; must be greater than 0end
- Biological end; must be less than length + 1- Returns:
- A SequenceView of the offset
-
getInverse
Description copied from interface:Sequence
Does the right thing to get the inverse of the current Sequence. This means either reversing the Sequence and optionally complementing the Sequence.- Specified by:
getInverse
in interfaceSequence<C extends Compound>
-