AccessEngine :: AEDevice :: AEOutput :: Word :: Word :: Class Word

Class Word

object --+
         |
        Word

Represents a word in a body of text. Each Word has a main and a trailing part where the main part is processed according to other flags in the current WordState to improve its presentation to the user via a speech or other output device while the trailing part remains unprocessed. The value of WordDef determines what characters lie in the main and trailing parts of each word. The following constants are available in AEConstants.

WORD_NON_BLANK: All non-blank characters are added to the main part
WORD_ALPHABETIC: All characters considered letters in the current locale are added to the main part
WORD_ALPHA_NUMERIC: All characters considered letters and digits in the current locale are added to the main part
WORD_ALPHA_PUNCT: All characters considered letters and punctuation in the current locale are added to the main part
WORD_ALPHA_NUMERIC_PUNCT: All characters considered letters, digits, and punctuation in the current locale are added to the main part

Characters in the ignore list are considered blank. A AEPor can be associated with a Word to indicate its context in a larger body of text.

Callables may be specified as observers for characters processed by the main and trail parts of each Word. An observer must take four parameters, this Word instance, the WordState in use, the current character, and the list of all characters in the main or trail part of the word. The observer should return the character to be added. The list may be modified in place to affect the final contents of the word.

Instance Methods

[hide private]

__init__(self, state, por, main_ob=None, trail_ob=None)
Stores the WordState and initializes all instance variables.

source code

__eq__(self, other)
Compares this Word to the one provided based on their AEPors and content.

source code

string

__unicode__(self)
Gets this Word as a unicode string.

source code

string

__str__(self)
Gets this Word as a non-unicode string.

source code

_isMainChar(self, ch)
Determines if the given character should be considered a part of the main part of this word or not based on the definition of the word given by WordState.

source code

replaceMain(self, text)
Replaces the main part of the word with the given string.

source code

replaceTrail(self, text)
Replaces the main part of the word with the given string.

source code

AEPor

getPOR(self)
Gets the AEPor associated with the start of this Word.

source code

boolean

isBlank(self, ch)
Determines if the given character is blank or ignored.

source code

boolean

isAlpha(self, ch)
Determines if the given character is a letter in the current locale.

source code

boolean

isNumeric(self, ch)
Determines if the given character is a number in the current locale.

source code

boolean

isPunctuation(self, ch)
Determines if the given character is a punctuation mark.

source code

boolean

isSymbol(self, ch)
Determines if the given character is a symbol.

source code

boolean

isVowel(self, ch)
Determines if the given character is a vowel.

source code

boolean

isCap(self, ch)
Determines if the given character is an upper case letter.

source code

string

getCharValue(self, ch)
Gets the unicode hex value for a character sans the 0x prefix.

source code

string

getCharName(self, ch)
Gets the unicode name of the character, one of the strings listed in the http://unicode.org/charts/charindex.html.

source code

boolean

getCharDescription(self, ch)
Gets a localized description of the given character.

source code

string

getSource(self)
Gets the unprocessed text of this word as it was seen in the original text source.

source code

integer

getSourceLength(self)
Gets the length of the unprocessed source text of this Word.

source code

integer

getMainLength(self)
Gets the length of the processed main part of this Word.

source code

boolean

moreAvailable(self)
Makes a guess as to whether or not there are more Words in the body of text from which this word originated.

source code

boolean

hasRepeat(self)
Gets if this Word has a character repeated more than the maximum number of repetitions allowed or not.

source code

boolean

hasCap(self)
Gets if this Word contains an uppercase letter or not.

source code

boolean

hasVowel(self)
Gets if this Word contains a vowel or not.

source code

boolean

isAllCaps(self)
Gets if this Word is all capitals or not.

source code

boolean

isAllNumeric(self)
Gets if this Word is all numbers or not.

source code

boolean

isAllBlank(self)
Gets if this Word is all blanks or not.

source code

string or None

append(self, chunk)
Parses the given chunk of text for characters that should be added to the main_part or trail_part of this Word.

source code

string

_processMain(self, ch)
Adds the given character to the source_word.

source code

string

_processTrail(self, ch)
Adds the given character to the source_word.

source code

Inherited from object: __delattr__, __getattribute__, __hash__, __new__, __reduce__, __reduce_ex__, __repr__, __setattr__

Instance Variables

[hide private]

integer

curr_repeat
Indicates a character should be considered a repeat iff this value > MaxRepeat.

boolean

has_main
Has at least one main character been parsed?

string

last_char
Last character appended to this Word

boolean

main_done
Is the main_part complete?

callable

main_ob
Function to invoke for each character in the main part of a word

list

main_part
Part of this Word that will receive extra preparation for output

boolean

more
Are there likely more Words after this one in the text source where this Word originated?

AEPor

por
Point of regard indicating where this Word originated

list

source_word
Original text of this Word without any preparation for output applied

WordState

state
Settings that determine the definition of a Word and how it is prepared for output

boolean

trail_done
Is the trail_part complete?

callable

trail_ob
Function to invoke for each character in the trailing part of a word

list

trail_part
Part of the word that will receive little preparation for output

Properties

[hide private]

Inherited from object: __class__

Method Details

Class Word

__init__(self, state, por, main_ob=None, trail_ob=None) (Constructor)

__eq__(self, other) (Equality operator)

__unicode__(self)

__str__(self) (Informal representation operator)

_isMainChar(self, ch)

replaceMain(self, text)

replaceTrail(self, text)

getPOR(self)

isBlank(self, ch)

isAlpha(self, ch)

isNumeric(self, ch)

isPunctuation(self, ch)

isSymbol(self, ch)

isVowel(self, ch)

isCap(self, ch)

getCharValue(self, ch)

getCharName(self, ch)

getCharDescription(self, ch)

getSource(self)

getSourceLength(self)

getMainLength(self)

moreAvailable(self)

hasRepeat(self)

hasCap(self)

hasVowel(self)

isAllCaps(self)

isAllNumeric(self)

isAllBlank(self)

append(self, chunk)

_processMain(self, ch)

_processTrail(self, ch)

curr_repeat

init(self, state, por, main_ob=None, trail_ob=None)
(Constructor)

eq(self, other)
(Equality operator)

unicode(self)

str(self)
(Informal representation operator)