Break.Low
Low level interface.
This interface may be useful for table based implementers of segmenters. For each kind of break, property values are assigned integer values starting from 0
. An array allows to recover the high-level representation of the corresponding property value.
Warning. Do not mutate these array.
val line : Uchar.t -> int
line u
is an integer that can be used with line_of_int
.
line_max
is the maximal value returned by line
.
val line_of_int : line array
line_of_int.(i)
is the line break property value corresponding to i
.
val grapheme_cluster : Uchar.t -> int
grapheme_cluster u
is an integer that can be used with grapheme_cluster_of_int
.
grapheme_cluster_max
is the maximal value returned by grapheme_cluster
.
val grapheme_cluster_of_int : grapheme_cluster array
grapheme_cluster_of_int.(i)
is the grapheme cluster break property value corresponding to i
.
val word : Uchar.t -> int
word u
is an integer that can be used with word_of_int
.
word_max
is the maximal value returned by word
.
val word_of_int : word array
word_of_int.(i)
is the word break property value corresponding to i
.
val sentence : Uchar.t -> int
sentence u
is an integer that can be used with sentence_of_int
.
sentence_max
is the maximal value returned by sentence
.
val sentence_of_int : sentence array
sentence_of_int.(i)
is the sentence break property value corresponding to i
.
val indic_conjunct_break : Uchar.t -> int
indic_conjunct_break u
is an integer that can be used with indic_conjunct_break_of_int
.
indic_conjunct_break_max
is the maximal value returned by indic_conjunct_break_of_int
.
val indic_conjunct_break_of_int : indic_conjunct_break array
indic_conjunct_break.(i)
is the Indic conjunct break property value corresponding to i
.