Segmenting XLIFF 2.0 Files by Paragraph
- Laura White (Unlicensed)
- Kenny Teves
- Aaron Sakievich (Unlicensed)
For highly technical or creative content (where a degree of transcreation is needed), it may be desirable to segment on a paragraph (rather than a sentence). We’ve added the ability to turn segmentation on or off for XLIFF 2.0 files via the FPRM filter. Additionally, we’ve added the option to turn segmentation on/off for a specific unit ID.
To segment a document by something other than the XLIFFÂ 2.0 defaults,
- Enable paragraph segmentation within the FPRM filter.
- Enable paragraph segmentation within the document. This can be controlled via each unit within the document.
Set the FPRM Filter
Adjust Segmentation (via the FPRM Filter Config)
By default, Enterprise's XLIFF 2.0 filter configuration segments documents into sentences. To adjust segmentation, use the following variables (mergeAsParagraph, needsSegmentation) in the FPRM filter config file (defined and outlined below).
mergeAsParagraph.b
In the filter config, use mergeAsParagraph.b to specify how segments will be treated when the file is downloaded again.
- True: The file will be merged back with its original segmentation.
- False: The file will be merged back with the new segmentation specified by Enterprise.
needsSegmentation.b
In the filter config, use needsSegmentation.b to determine whether segmentation can be adjusted on the XLIFF 2.0 file. (XLIFF by nature already has the segments/text units defined. The default behavior in the XLIFF 2 filter is to NOT resegment).
- True: Further segmentation can be enabled on the XLIFF file.
- False:Â Further segmentation is not enabled on the XLIFF file.
A bilingual XLIFF CANNOT have the filter config set to needsSegmentation.b="true" (this will cause it to error out).
Choose to segment by paragraph, sentence, or phrase.
Use the instructions below to adjust the FPRM filter config.
 Tip: When uploading a document needing custom segmentation, apply the newly created FPRM filter config.
- To segment on paragraphs, add:
- mergeAsParagraph.b=true
- needsSegmentation.b=trueÂ
Â
- To segment on sentences, leave the FPRM filter's default settings.
 - To segment on phrases (i.e. something shorter than a sentence), add:
- mergeAsParagraph.b=false
- needsSegmentation=true
Set the Document
After applying the appropriate FPRM filter, go into the XLIFF 2.0 document and adjust its segmentation. By default, the file will segment on sentences. If there is a paragraph that should not be segmented (i.e. its contents should be a single segment), you can set the canResegment variable.
canResegment
To combine all sentences within a unit into a single segment, go into the unit and set canResegment=no. This will bundle the entire paragraph into a single segment.