Document Processing Bus: CITE.DocuProc
The CITE.DocuProc undertakes the extraction of attributes, metadata and text from the documents delivered to it.
Specifically, it receives one or more documents that are part of a message along with the metadata that accompanies it until the specific phase of its processing by CITE.BPMS and after processing it through a series of filters, returns any possible digital files and metadata for the message and documents it received. Thus, through CITE.DocuProc can exemplarily export alternative formats of a file (eg MS Office to PDF), convert an image to alternative compression formats (e.g. jpg to png, tiff to jpeg etc.), extract the text of a file ( e.g. full text from an MS Office document or PDF), visual recognition of characters from a document scan file (supported in 10,000 languages), it can convert a video file (e.g. from any format to mp4 suitable for web traffic), extract structured metadata (e.g. location and time information from photos) etc.