|
|
|
@@ -881,7 +881,7 @@ curl --request PUT \ |
|
|
|
{ |
|
|
|
"name": "manual.txt", |
|
|
|
"chunk_method": "manual", |
|
|
|
"parser_config": {"chunk_token_count": 128} |
|
|
|
"parser_config": {"chunk_token_num": 128} |
|
|
|
}' |
|
|
|
|
|
|
|
``` |
|
|
|
@@ -910,7 +910,7 @@ curl --request PUT \ |
|
|
|
- `"parser_config"`: (*Body parameter*), `object` |
|
|
|
The configuration settings for the dataset parser. The attributes in this JSON object vary with the selected `"chunk_method"`: |
|
|
|
- If `"chunk_method"` is `"naive"`, the `"parser_config"` object contains the following attributes: |
|
|
|
- `"chunk_token_count"`: Defaults to `256`. |
|
|
|
- `"chunk_token_num"`: Defaults to `256`. |
|
|
|
- `"layout_recognize"`: Defaults to `true`. |
|
|
|
- `"html4excel"`: Indicates whether to convert Excel documents into HTML format. Defaults to `false`. |
|
|
|
- `"delimiter"`: Defaults to `"\n"`. |