/ Home » About us » News » Nuxeo Launches New Modules for Semantic Linking and Auto-Categorization

« Nuxeo Partners with KOM Networks to... All news Nuxeo Studio 2.0 Now Available »

12/07/2010 02:00 pm

Nuxeo Launches New Modules for Semantic Linking and Auto-Categorization

Nuxeo Enterprise Platform and Apache Stanbol Semantic Engine Integration Available via Nuxeo Marketplace

Boston, Paris – Dec. 7, 2010 – Nuxeo, the Open Source Enterprise Content Management (ECM) platform company, announced today the availability of two new modules in the Nuxeo Marketplace that use semantic technologies to automate linked data services within a Nuxeo EP-based repository. The two new modules both use semantic technology to extend the body of information about recognizable data entities in a content repository. They are available as optional extensions to Nuxeo Enterprise Platform (EP), or any of the products and frameworks based on Nuxeo EP, such as Nuxeo DM. Both modules are immediately available for download via the new Nuxeo Marketplace, the first ECM industry “app store.”

“Automated Document Categorization” package: allows any Nuxeo EP-based content application to automatically complete document metadata for a newly created document, based on the textual content of the electronic file. Metadata such as language, subject, geographic coverage includes elements from the Dublin Core metadata standard, and from the Nuxeo application. When a new document is added, the text is extracted then tokenized and each token counted to perform advanced statistical analysis to suggest the most likely categories for the metadata fields.

“Semantic Linking” package: provides a call to the open source Apache incubator project now known as “Apache Stanbol.” Nuxeo has been a very active contributor to this open source OSGi-based RESTful semantic engine project, established under the Interactive Knowledge Stack project (IKS) and formerly known as “FISE.” This semantic service analyzes document text to find notable people, places, or organizations using DBPedia, as an online reference knowledge base created from information extracted from Wikipedia. The semantic engine identifies notable entities within the file text. An entity hub then enables access to related information, such as lists of other repository documents that reference the same entity, and descriptions and images from DBPedia. Entities can also be manually created, or manually linked from a document. News agencies, educational institutions, research firms or any organization needing quick, accurate identification of known personalities, organizations, or places, across large volumes of text, will benefit from this packaged module.

Features and a demonstration can be viewed in this video.

“Nuxeo has been contributing heavily to this exciting initiative and is pleased to see it gain momentum with acceptance as “Stanbol” under the Apache incubator program,” notes Eric Barroca, Nuxeo CEO. “This project is unique and important in the evolution of content analytics, because it is available as open source, ensuring semantic linking capabilities can be embedded and used across a broad range of content-enabled applications either online or on-premise inside enterprises.”

Nuxeo Marketplace is a complete solution catalog offering a range of packaged plug-ins, templates and applications created by Nuxeo developers, Nuxeo Galaxy partners and customers. Nuxeo Connect subscribers can access Nuxeo Marketplace via their Nuxeo Connect portal sign-in. Non-subscribers can sign up for a 30-day free trial account that also includes access to Nuxeo Studio.

About Nuxeo 
Nuxeo delivers an open source document management application built with a complete, modular and extensible open source platform for enterprise content management. Other packaged applications built with the platform provide solutions for digital asset management and case management. Designed by developers for developers, the Nuxeo Enterprise Platform offers modern technologies, unmatched modularity, a powerful plug-in model and extensive packaging capabilities. Using a fully open source development model, Nuxeo provides a subscription program with software maintenance, technical support and customization tools. Nuxeo ECM is trusted by 1000+ organizations across 145 countries, including Cengage Learning, Pearson Education, AFP News Agency, EllisDon and Jeppesen, a Boeing Company. Nuxeo is dual-headquartered in North America (Boston) and Western Europe (Paris). More information is available at www.nuxeo.com.

Contact for Nuxeo:
Sarah Conway
Clement Communications
978-578-5300
pr@nuxeo.com

Export PDF