|
ONTOLOGY-BASED HARVESTING AND RESEARCH ANALYSIS (OHARA)
SYSTEM OVERVIEW
Ontology-based Harvesting And Research Analysis (OHARA) System is ASALTA Technologies proprietary enterprise solution to efficiently acquire, organize, assess and share relevant information. OHARA is powered centrally by Knorex Lumina, an enterprise-level service-oriented knowledge discovery engine designed from ground-up to enable easy aggregation, mining, integration from disparate and heterogeneous (structured, e.g. relational database, spreadsheet), semi-structured (XML, emails, HTML) and unstructured (free-text articles)) data sources, querying and searching, all within a single platform. With the inference and reasoning capabilities built as part of the infrastructure, a domain specific ontology can be easily injected into the engine. Coupled with appropriate information processing, assessment and organization capabilities, OHARA becomes a powerful knowledge acquisition and discovery system. Figure 1: System Architecture Overview
KEY SYSTEM FEATURES (a)Information AcquisitionOHARA provides users with more than one method to acquire information into the system. Details as follow:
Semantic CrawlersBased on a domain specific ontology the Semantic Crawlers acquires information from the Internet and brings it back into OHARA automatically. The crawlers can search the Internet based on specific URLs or scheduled to crawl any organization databases. The crawlers have the capability of detecting and removing duplicate articles.
Scan WidgetMost users conducting any research would typically look out for more than one
type of information and possibly for more than one project. The scan widget allows users to manually upload these articles during their Internet research “on the fly”; without the need to launch OHARA. These uploaded articles will be stored temporarily in the system’s article sandbox and will only be transferred into the system’s database after users have confirmed and converted the article in the system (b) Information AssessmentOHARA provides users with easy to use information assessment capabilities combining both auto and manual methods to provide more details and resolution about an article to enhance the organization’s capability in knowledge discovery.
Article (Information) Tagging ModuleOHARA provides users with easy to use information assessment capabilities combining both auto and manual methods to provide more details and resolution about an article to enhance the organization’s capability in knowledge discovery. Article (Information) Tagging Module OHARA allows the users to tag (auto-tagging feature will not be part of the trial) the articles by title, description (or summary), author, source name (e.g. CNN), article URL and article category (primary tag- standard category of the article, i.e. according to how it is described in the ontology). The system also allows users to manually tag the articles with location, source type, free-form tags (secondary tags, i.e. flagging keywords of interest based on pertinent information from the article).
Article Assessment ModuleThis module will be customized according to how our client wants to rate or assess their articles. The ratings will be shown via the Dashboards provided in the system.
(c) Information OrganizationOHARA provides a single enterprise database for storage and retrieval of all articles, reports, tags, project setups and analysis.
Project Management ModuleThe module allows users to set up monitoring or research projects in OHARA. It provides all necessary functions to schedule the crawling frequency (i.e. Daily, Weekly or Monthly) of the crawlers. The module also enables users (administrators only) to add specific websites (i.e. URLs) for crawling via the project management module.
Monitor 360This module allows users to create, publish and track issues of concern (e.g.
Requirements analysis, Systems Safety etc). Users can manually create categories (EEIs and Indicators) and publish relevant articles to them. Based on the domain specific ontology, the auto-categorisation* feature will enable articles to be auto-categorized and placed into their respective topic bins.Users can then assess the importance of the article and publish the article to the information portal. *The auto-categorization feature in this module is available as a paid option only. Horizon 360This module allows users to publish issues that are emerging (i.e. yet to break out into the mainstream) and new. Users can create manual categories and then assign relevant articles into their respective article categories.
Clarity 360This module allows users to publish documents in all data formats, (e.g. pdf, doc, txt, ppt and xls). The purpose for this module is for users to publish and store internally generated reports. Users will be able to create categories to organization their documents.
(d) Information Retrieval and DiscoveryOHARA provides users with two capabilities to retrieve and discover information stored in the system database.
Enterprise SearchThe Enterprise Search feature allows the user to search the entire database based on Title, Content, Date, Source and Tags. Federated Search is available as a paid option.
Visual AnalyticsThe visual analytics provide users with information visualization capability. This capability enables all users with or without statistical capability to identify information patterns and discover new knowledge about the corpus of data in the database. In a normal system deployement, ASALTA will be providing a basic version of this feature. An advanced version of visual analytics will be available as a paid option.
(e) Information SharingOHARA provides users an enterprise wide information portal platform deployable via the Internet or intranet within an organization.
Information PortalThe information portal provides a one-stop information platform to view all published articles in a well-organized manner.
Report GeneratorThe report generator provides the user to collate relevant information and rapidly create reports for sharing. These reports can be generated in the form of HTML that can be easily copy and transferred into an email or a pdf document to be shared. The format of the report will be customized according to client’s needs. For the purpose of the trial, ASALTA will provide one customized report format only.
|
|





