Module Break.Low

Low level interface.

This interface may be useful for table based implementers of segmenters. For each kind of break, property values are assigned integer values starting from 0. An array allows to recover the high-level representation of the corresponding property value.

Low level access to break properties

Warning. Do not mutate these array.

val line : Uchar.t -> int

line u is an integer that can be used with line_of_int.

val line_max : int

line_max is the maximal value returned by line.

val line_of_int : line array

line_of_int.(i) is the line break property value corresponding to i.

val grapheme_cluster : Uchar.t -> int

grapheme_cluster u is an integer that can be used with grapheme_cluster_of_int.

val grapheme_cluster_max : int

grapheme_cluster_max is the maximal value returned by grapheme_cluster.

val grapheme_cluster_of_int : grapheme_cluster array

grapheme_cluster_of_int.(i) is the grapheme cluster break property value corresponding to i.

val word : Uchar.t -> int

word u is an integer that can be used with word_of_int.

val word_max : int

word_max is the maximal value returned by word.

val word_of_int : word array

word_of_int.(i) is the word break property value corresponding to i.

val sentence : Uchar.t -> int

sentence u is an integer that can be used with sentence_of_int.

val sentence_max : int

sentence_max is the maximal value returned by sentence.

val sentence_of_int : sentence array

sentence_of_int.(i) is the sentence break property value corresponding to i.

val indic_conjunct_break : Uchar.t -> int

indic_conjunct_break u is an integer that can be used with indic_conjunct_break_of_int.

val indic_conjunct_break_max : int

indic_conjunct_break_max is the maximal value returned by indic_conjunct_break_of_int.

val indic_conjunct_break_of_int : indic_conjunct_break array

indic_conjunct_break.(i) is the Indic conjunct break property value corresponding to i.