C2RSS helps a user monitor information streams (e.g., RSS feeds) by identifying documents that are of potential interest to that user. The system automatically sorts information by topic and relevance, leveraging user feedback to refine suggestions.
Components
CALO Express
CALO Express (CE) is a lightweight personal desktop assistant that uses learning technology to identify relevant information on the workstation. CE finds and organizes information in email, on a workstation, and on shared drives, to quickly assemble new presentations and to prepare for meetings by gathering relevant information. CE provides a plug-in architecture to easily … Read more
Classification Suite
Classification assigns individual items to discrete groups pre-specified by the user, based on features of the items. The classification module within the PAL Framework provides a unified API for three classification algorithms: Transformed Weight-Normalized Complement Naive Bayes, Maximum Entropy, and Decision Trees. API Documentation Download Distribution Example Applications Licensing Info References Overview Classification assigns individual … Read more
Clustering Suite
Clustering organizes objects into groups based on their similarity, as evaluated by comparison of designated object attributes. The PAL Framework provides a unified API for clustering for three state-of-the-art clustering algorithms: Latent Dirichlet Allocation, Lingo, and Katz. API Documentation Download Distribution Example Applications Licensing Info References Overview Clustering is a learning algorithm that organizes objects … Read more
Contact Management (DEX)
Contact Management (DEX) is a server-based utility that, given a person’s email address and learned models of users’ home pages, crawls the Web to find contact information such as addresses, phone numbers, and expertise keywords. API Documentation Contact Management web UI Download Package Java Client Coding Example Licensing Info References Overview Contact Management (also known … Read more
Form Online Analyzer and Manager (FOAM)
Form Online Analyzer and Manager (FOAM) presents ranked suggestions for completing web-based forms based on cross-field models learned from previous form-filling episodes. This contextualized learning enables more accurate suggestions for forms with dependencies among fields. If you use FOAM, it is your responsibility to comply with all applicable privacy and data protection laws. Failure to … Read more
iLink
iLink is a social networking component for building a dynamic topic model for each user that is used by the recommendation engine to suggest related people, or information artifacts. iLink’s FAQtory allows a community to ask and answer questions and to build a FAQ repository. Overview Social networks and online communities have become ubiquitous forums … Read more
Instrumentation and Automation
Instrumentation and Automation provides recording and executing a broad set of events for Thunderbird and Firefox including composing and sending emails, attaching documents, and browsing URLs. Instrumentation and Automation — last modified Sep 29, 2010 10:41 AM API Documentation Download Source Package Example Applications Supported Events JavaDoc Licensing Info Overview Instrumentation and automation play … Read more
MALLET
Mallet provides a range of machine learning capabilities applied to statistical natural language processing mechanisms, including document classification, clustering, and information extraction. API Documentation Download Package Example Applications Licensing Info References Overview MALLET provides a range of machine learning capabilities applied to statistical natural-language processing mechanisms, including document classification, clustering, and information extraction. MALLET was … Read more
Meeting Assistant
Meeting Assistant (MA) is designed to help a user in meeting environments. It is focused on the collection of speech in meeting with basic natural language discourse and dialog understanding. Outputs from this component can be integrated with other delivered systems and research efforts to perform efforts such as: (1) indexing the meeting speech for … Read more
MinorThird
MinorThird provides methods for storing, annotating, and categorizing text as well as learning to extract entities. API Documentation Download Package Example Applications Licensing Info References Overview MinorThird provides methods for storing, annotating, and categorizing text as well as learning to extract entities. MinorThird was developed by researchers at Carnegie Mellon University, primarily with DARPA funding. … Read more
PAL Semantic Extraction
Semantic Extraction learns from training examples to recognize entities and semantic meta-structures such as names, addresses and structured phrases from a body of text. API Documentation Download Semantic Extraction Java 1.6 Jar file Download Semantic Extraction source files Example Applications Licensing Info Overview PAL Semantic Extraction is a general-purpose, lightweight extraction engine that employs machine … Read more
PAL SideBar UI
PAL Sidebar UI provides flexible, generic interface capabilities for interacting with learning components. The PAL Sidebar UI provides several built-in widgets for collecting user feedback, responses to suggestions, and status information that can provide essential data to drive learning components. API Documentation Download Source Package Desktop Sidebar (build 116) Download Sidebar Installer Package Examples … Read more
PrepPak
PrepPak finds relevant and related objects for a target object. A typical example is finding emails, documents, and contacts that are relevant to a given meeting. PrepPak supports feature extraction for the target object, algorithms to rank objects returned by their relevance, and a mechanism to refine those suggestions based on user feedback. Overview PrepPak … Read more
Probabilistic Consistency Engine (PCE)
Probabilistic Consistency Engine (PCE) combines possibly conflicting evidence from a collection of data sources into a most probable hypothesis consistent with evidence. PCE takes as input a collection of facts and weighted rules and generates the marginal probabilities of individual atoms and formulas, using mechanisms based on Markov Logic Networks. A new development version of … Read more