Numerous implementation achievements in the industry! A high-precision, ultra-fast text extraction library from document files.
The "Dehenken TF Library" is an embedded text filter library that extracts text from document files such as MS-Office, PDF, and Ichitaro. It analyzes the internal binary data of the format to extract property information and text information. Its overwhelming speed significantly reduces the indexing generation time for full-text searches. 【Features】 ■ High-precision, ultra-fast text extraction library ■ Extensive implementation track record in the industry ■ Options for diverse output ■ Provides an environment that enhances development productivity *For more details, please refer to the PDF materials or feel free to contact us.
Inquire About This Product
basic information
【Other Features】 ■ Supports multi-threading ■ Compatible with Unicode and local character encoding ■ Configuration settings for safety (doccat.conf) *For more details, please refer to the PDF document or feel free to contact us.
Price range
Delivery Time
Applications/Examples of results
For more details, please refer to the PDF document or feel free to contact us.
catalog(1)
Download All CatalogsCompany information
Our company is a software OEM company with strengths in text processing technology. We provide technical software to our customers (companies) that develop software, enabling their products to become more competitive in the market. We offer products such as the text extraction "Dehenken TF Library," full-text search "Cyclope Engine," and personal information file detection "Dehenken Audit Library."