Splits text from the 'text' column of a data frame into individual paragraphs, based on a specified paragraph delimiter.
nlp_split_paragraphs(tif, paragraph_delim = "\\n+")
A data.table with columns: `doc_id`, `paragraph_id`, and `text`. Each row represents a paragraph, along with its associated document and paragraph identifiers.
tif <- data.frame(doc_id = c('1', '2'),
text = c("Hello world.\n\nMind your business!",
"This is an example.n\nThis is a party!"))
paragraphs <- nlp_split_paragraphs(tif)