Skip to main content

Harmonious Center of Competency

Hitachi

In the current business environment, there is an urgent need to analyze log data such as operation logs and search logs in order to enhance security against information leaks. When the necessary information is retrieved from vast volumes of log data using conventional log analysis methods, there is always a probability of overlooking information because terms that are not stored in a dictionary cannot be searched. Hitachi's full-text search solution adopts Hitachi's original search algorithm, which is an improved version of the n-gram(*1) method. This flexible log search method from Hitachi can perform a search operation with minimal oversights and is capable of handling the variations of text indication that are specific to the Japanese language.

*1
n-gram (N-gram character index method) --- This method separates the entered text by n characters, and creates an index consisting of a text number that contains each text string and location where the text string appears. When data is searched, this method can perform a high-speed full-text search operation without overlooking data by means of the abovementioned index.

"Hitachi Full-text Search Solution" performs a flexible search operation without overlooking information

Hitachi Full-text Search Solution provides a flexible search operation by integrating Hitachi's relational database software HiRDB with a full-text search engine that applies the n-gram method and by combining the text data and attribute information within a database.
This demonstration introduces the efficient search operation as well as the advantages of Hitachi Full-text Search Solution.

Contents of demonstration
  1. Description of Hitachi Full-text Search Solution
    • Log analysis using a conventional method
    • Log analysis using the n-gram method
  2. Actual machine-based demonstration
    • Demonstration of full-text search of log using the n-gram method
This demonstration consists of a description of the full-text search operation using a presentation tool (PPT) and an actual machine-based demonstration.
Required time Approximately 30 minutes