ICU 65.1
65.1
|
The CollationElementIterator class is used as an iterator to walk through each character of an international string. More...
#include <coleitr.h>
Public Types | |
enum | { NULLORDER = (int32_t)0xffffffff } |
Public Member Functions | |
CollationElementIterator (const CollationElementIterator &other) | |
Copy constructor. More... | |
virtual | ~CollationElementIterator () |
Destructor. More... | |
UBool | operator== (const CollationElementIterator &other) const |
Returns true if "other" is the same as "this". More... | |
UBool | operator!= (const CollationElementIterator &other) const |
Returns true if "other" is not the same as "this". More... | |
void | reset (void) |
Resets the cursor to the beginning of the string. More... | |
int32_t | next (UErrorCode &status) |
Gets the ordering priority of the next character in the string. More... | |
int32_t | previous (UErrorCode &status) |
Get the ordering priority of the previous collation element in the string. More... | |
int32_t | getMaxExpansion (int32_t order) const |
Return the maximum length of any expansion sequences that end with the specified comparison order. More... | |
int32_t | strengthOrder (int32_t order) const |
Gets the comparison order in the desired strength. More... | |
void | setText (const UnicodeString &str, UErrorCode &status) |
Sets the source string. More... | |
void | setText (CharacterIterator &str, UErrorCode &status) |
Sets the source string. More... | |
int32_t | getOffset (void) const |
Gets the offset of the currently processed character in the source string. More... | |
void | setOffset (int32_t newOffset, UErrorCode &status) |
Sets the offset of the currently processed character in the source string. More... | |
virtual UClassID | getDynamicClassID () const |
ICU "poor man's RTTI", returns a UClassID for the actual class. More... | |
UCollationElements * | toUCollationElements () |
const UCollationElements * | toUCollationElements () const |
Public Member Functions inherited from icu::UObject | |
virtual | ~UObject () |
Destructor. More... | |
Static Public Member Functions | |
static int32_t | primaryOrder (int32_t order) |
Gets the primary order of a collation order. More... | |
static int32_t | secondaryOrder (int32_t order) |
Gets the secondary order of a collation order. More... | |
static int32_t | tertiaryOrder (int32_t order) |
Gets the tertiary order of a collation order. More... | |
static UBool | isIgnorable (int32_t order) |
Checks if a comparison order is ignorable. More... | |
static UClassID | getStaticClassID () |
ICU "poor man's RTTI", returns a UClassID for this class. More... | |
static CollationElementIterator * | fromUCollationElements (UCollationElements *uc) |
static const CollationElementIterator * | fromUCollationElements (const UCollationElements *uc) |
Friends | |
class | RuleBasedCollator |
class | UCollationPCE |
The CollationElementIterator class is used as an iterator to walk through each character of an international string.
Use the iterator to return the ordering priority of the positioned character. The ordering priority of a character, which we refer to as a key, defines how a character is collated in the given collation object. For example, consider the following in Slovak and in traditional Spanish collation:
"ca" -> the first key is key('c') and second key is key('a'). "cha" -> the first key is key('ch') and second key is key('a').
And in German phonebook collation,
"æb"-> the first key is key('a'), the second key is key('e'), and the third key is key('b').
The key of a character, is an integer composed of primary order(short), secondary order(char), and tertiary order(char). Java strictly defines the size and signedness of its primitive data types. Therefore, the static functions primaryOrder(), secondaryOrder(), and tertiaryOrder() return int32_t to ensure the correctness of the key value.
Example of the iterator usage: (without error checking)
void CollationElementIterator_Example(){UnicodeString str = "This is a test";UErrorCode success = U_ZERO_ERROR;RuleBasedCollator* rbc =(RuleBasedCollator*) RuleBasedCollator::createInstance(success);rbc->createCollationElementIterator( str );int32_t order = c->next(success);c->reset();order = c->previous(success);delete c;delete rbc;}
The method next() returns the collation order of the next character based on the comparison level of the collator. The method previous() returns the collation order of the previous character based on the comparison level of the collator. The Collation Element Iterator moves only in one direction between calls to reset(), setOffset(), or setText(). That is, next() and previous() can not be inter-used. Whenever previous() is to be called after next() or vice versa, reset(), setOffset() or setText() has to be called first to reset the status, shifting pointers to either the end or the start of the string (reset() or setText()), or the specified position (setOffset()). Hence at the next call of next() or previous(), the first or last collation order, or collation order at the spefcifieid position will be returned. If a change of direction is done without one of these calls, the result is undefined.
The result of a forward iterate (next()) and reversed result of the backward iterate (previous()) on the same string are equivalent, if collation orders with the value 0 are ignored. Character based on the comparison level of the collator. A collation order consists of primary order, secondary order and tertiary order. The data type of the collation order is int32_t.
Note, CollationElementIterator should not be subclassed.
anonymous enum |
icu::CollationElementIterator::CollationElementIterator | ( | const CollationElementIterator & | other | ) |
|
virtual |
Destructor.
|
inlinestatic |
|
inlinestatic |
|
virtual |
ICU "poor man's RTTI", returns a UClassID for the actual class.
Reimplemented from icu::UObject.
int32_t icu::CollationElementIterator::getMaxExpansion | ( | int32_t | order | ) | const |
Return the maximum length of any expansion sequences that end with the specified comparison order.
order | a collation order returned by previous or next. |
int32_t icu::CollationElementIterator::getOffset | ( | void | ) | const |
Gets the offset of the currently processed character in the source string.
|
static |
ICU "poor man's RTTI", returns a UClassID for this class.
|
inlinestatic |
int32_t icu::CollationElementIterator::next | ( | UErrorCode & | status | ) |
Gets the ordering priority of the next character in the string.
status | the error code status. |
UBool icu::CollationElementIterator::operator!= | ( | const CollationElementIterator & | other | ) | const |
Returns true if "other" is not the same as "this".
other | the object to be compared |
UBool icu::CollationElementIterator::operator== | ( | const CollationElementIterator & | other | ) | const |
Returns true if "other" is the same as "this".
other | the object to be compared |
int32_t icu::CollationElementIterator::previous | ( | UErrorCode & | status | ) |
Get the ordering priority of the previous collation element in the string.
status | the error code status. |
|
inlinestatic |
void icu::CollationElementIterator::reset | ( | void | ) |
Resets the cursor to the beginning of the string.
|
inlinestatic |
void icu::CollationElementIterator::setOffset | ( | int32_t | newOffset, |
UErrorCode & | status | ||
) |
Sets the offset of the currently processed character in the source string.
newOffset | the new offset. |
status | the error code status. |
void icu::CollationElementIterator::setText | ( | const UnicodeString & | str, |
UErrorCode & | status | ||
) |
Sets the source string.
str | the source string. |
status | the error code status. |
void icu::CollationElementIterator::setText | ( | CharacterIterator & | str, |
UErrorCode & | status | ||
) |
Sets the source string.
str | the source character iterator. |
status | the error code status. |
int32_t icu::CollationElementIterator::strengthOrder | ( | int32_t | order | ) | const |
Gets the comparison order in the desired strength.
Ignore the other differences.
order | The order value |
|
inlinestatic |
|
inline |
|
inline |