docs/algo/data_format_en.md
A text format where each line represents a labeled feature vector using the following format:
label index1:value1 index2:value1 index3:value3 ...
# libsvm example
1 1:0.5 3:3.1 7:1.0
0 2:0.1 3:2.3 5:2.0
1 4:0.2 7:1.1 9:0.0
....
A text format where each line represents a labeled feature vector, separated by (space) using the following format:
label index1 index2 index3
# dummy type example
0 3 7 999 666
1 0 2 88 77
...
Note: if the row splitor is not (space), you can specify a splitor (say ",") by using the following option:
ml.data.splitor=,
For multi-class problem, such as softmax regression, angel require the labels start from 0, while for binary classification problom, the labels are required +1 and -1. As a result, a transform may be reqired.