Based on Paper,the choice of base,method of mapping chinese character strings into integers and collision problem are researched,and an improved chinese character strings sorting technique is given.
(2) the probing strategy for extraction of text strings of arbitrary direction and length, which can extract Chinese/Western character strings integrally, and enhances its performance in processing speed, space complexity by introducing the concepts of accurate HOUGH space requirements, collinear relaxation and the string based HOUGH field refreshing method. The performance evaluation of SegChar is given finally.
Precise locating of character string is realized by exploiting horizontal projection and character arragements of binary images in horizontal and vertical directions respectively.
The hashing function is defined in terms of a trie, which is basically a binary tree where a character string is associated implicitly with each node.
Based on this cut-off criterion, a fast algorithm has been developed to find the nearest neighbors of a given character string in a dictionary.
It is demonstrated here that by introduction of three new operators, namely,position, fraction, andconcatenation, expressions of typical programming languages can be executed with a unified method starting from the character string.
Using a semiring interpretation, the weight of a morphological analysis result represents the possible linguistic interpretations of an input word, while the resulting character string itself represents the lemma of the input.