1. Property implementation
Title: Implementation
Specific value: "MarkdownHeaderTextSplitter"
2. Property chunk_size
Title: Chunk Size
| |
|---|
| Type | integer |
| Required | No |
| Default | 4000 |
Description: Maximum size of chunks to return
3. Property chunk_overlap
Title: Chunk Overlap
| |
|---|
| Type | integer |
| Required | No |
| Default | 200 |
Description: Overlap in characters between chunks
4. Property keep_separator
Title: Keep Separator
| |
|---|
| Type | boolean |
| Required | No |
| Default | false |
Description: Whether to keep the separator in the chunks
5. Property strip_whitespace
Title: Strip Whitespace
| |
|---|
| Type | boolean |
| Required | No |
| Default | true |
Description: If True, strips whitespace from the start and end of every document
Title: Headers To Split On
| |
|---|
| Type | array of array |
| Required | Yes |
Description: Headers we want to track, e.g., #, ##, etc.
| Array restrictions |
|---|
| Min items | N/A |
| Max items | N/A |
| Items unicity | False |
| Additional items | False |
| Tuple validation | See below |
| Array restrictions |
|---|
| Min items | 2 |
| Max items | 2 |
| Items unicity | False |
| Additional items | False |
| Tuple validation | See below |
7. Property return_each_line
Title: Return Each Line
| |
|---|
| Type | boolean |
| Required | No |
| Default | false |
Description: Return each line w/ associated headers