Home » Document Processing Bus: CITE.DocuProc

Document Processing Bus: CITE.DocuProc

back to CITE.BPMS

The CITE.DocuProc undertakes the extraction of attributes, metadata and text from the documents delivered to it.

Specifically, it receives one or more documents that are part of a message along with the metadata that accompanies it until the specific phase of its processing by CITE.BPMS and after processing it through a series of filters, returns any possible digital files and metadata for the message and documents it received. Thus, through CITE.DocuProc can exemplarily export alternative formats of a file (eg MS Office to PDF), convert an image to alternative compression formats (e.g. jpg to png, tiff to jpeg etc.), extract the text of a file ( e.g. full text from an MS Office document or PDF), visual recognition of characters from a document scan file (supported in 10,000 languages), it can convert a video file (e.g. from any format to mp4 suitable for web traffic), extract structured metadata (e.g. location and time information from photos) etc.