ONTOLOGY-BASED HARVESTING AND RESEARCH ANALYSIS (OHARA)

SYSTEM OVERVIEW

Ontology-based Harvesting And Research Analysis (OHARA) System is ASALTA Technologies  proprietary enterprise solution to efficiently acquire, organize, assess and share relevant information.

OHARA is powered centrally by Knorex Lumina, an enterprise-level service-oriented knowledge discovery engine designed from ground-up to enable easy aggregation, mining, integration from disparate and heterogeneous (structured, e.g. relational database, spreadsheet), semi-structured (XML, emails, HTML) and unstructured (free-text articles)) data sources, querying and searching, all within a single platform.

With the inference and reasoning capabilities built as part of the infrastructure, a domain specific ontology can be easily injected into the engine. Coupled with appropriate information processing, assessment and organization capabilities, OHARA becomes a powerful knowledge acquisition and discovery system.

 Ontology Based Harvesting And Research Analysis

Figure 1: System Architecture Overview

KEY SYSTEM FEATURES

(a)Information Acquisition
OHARA provides users with more than one method to acquire information into the system. Details as follow:
Semantic Crawlers
Based on a domain specific ontology the Semantic Crawlers acquires information from the Internet and brings it back into OHARA automatically. The crawlers can search the Internet based on specific URLs or scheduled to crawl any organization databases. The crawlers have the capability of detecting and removing duplicate articles.
Scan Widget
Most users conducting any research would typically look out for more than one

type of information and possibly for more than one project. The scan widget allows users to manually upload these articles during their Internet research “on the fly”; without the need to launch OHARA. These uploaded articles will be stored temporarily in the system’s article sandbox and will only be transferred into the system’s database after users have confirmed and converted the article in the system

(b) Information Assessment
OHARA provides users with easy to use information assessment capabilities combining both auto and manual methods to provide more details and resolution about an article to enhance the organization’s capability in knowledge discovery.
Article (Information) Tagging Module
OHARA provides users with easy to use information assessment capabilities combining both auto and manual methods to provide more details and resolution about an article to enhance the organization’s capability in knowledge discovery. Article (Information) Tagging Module OHARA allows the users to tag (auto-tagging feature will not be part of the trial) the articles by title, description (or summary), author, source name (e.g. CNN), article URL and article category (primary tag- standard category of the article, i.e. according to how it is described in the ontology). The system also allows users to manually tag the articles with location, source type, free-form tags (secondary tags, i.e. flagging keywords of interest based on pertinent information from the article).
Article Assessment Module
This module will be customized according to how our client wants to rate or assess their articles. The ratings will be shown via the Dashboards provided in the system.
(c) Information Organization
OHARA provides a single enterprise database for storage and retrieval of all articles, reports, tags, project setups and analysis.
Project Management Module
The module allows users to set up monitoring or research projects in OHARA. It provides all necessary functions to schedule the crawling frequency (i.e. Daily, Weekly or Monthly) of the crawlers. The module also enables users (administrators only) to add specific websites (i.e. URLs) for crawling via the project management module.
Monitor 360
This module allows users to create, publish and track issues of concern (e.g.

Requirements analysis, Systems Safety etc). Users can manually create categories (EEIs and Indicators) and publish relevant articles to them. Based on the domain specific ontology, the auto-categorisation* feature will enable articles to be auto-categorized and placed into their respective topic bins.Users can then assess the importance of the article and publish the article to the information portal. *The auto-categorization feature in this module is available as a paid option only.

Horizon 360
This module allows users to publish issues that are emerging (i.e. yet to break out into the mainstream) and new. Users can create manual categories and then assign relevant articles into their respective article categories.
Clarity 360
This module allows users to publish documents in all data formats, (e.g. pdf, doc, txt, ppt and xls). The purpose for this module is for users to publish and store internally generated reports. Users will be able to create categories to organization their documents.
(d) Information Retrieval and Discovery
OHARA provides users with two capabilities to retrieve and discover information stored in the system database.
Enterprise Search
The Enterprise Search feature allows the user to search the entire database based on Title, Content, Date, Source and Tags. Federated Search is available as a paid option.
Visual Analytics
The visual analytics provide users with information visualization capability. This capability enables all users with or without statistical capability to identify information patterns and discover new knowledge about the corpus of data in the database. In a normal system deployement, ASALTA will be providing a basic version of this feature. An advanced version of visual analytics will be available as a paid option.
(e) Information Sharing
OHARA provides users an enterprise wide information portal platform deployable via the Internet or intranet within an organization.
Information Portal
The information portal provides a one-stop information platform to view all published articles in a well-organized manner.
Report Generator
The report generator provides the user to collate relevant information and rapidly create reports for sharing. These reports can be generated in the form of HTML that can be easily copy and transferred into an email or a pdf document to be shared. The format of the report will be customized according to client’s needs. For the purpose of the trial, ASALTA will provide one customized report format only.
Share |