AI Model Building - A Simple Solution

This article provides both visual and written instructions for building AI models from CSV files


Introduction

An AI model is a software program that has been trained on a set of data to perform specific tasks such as recognizing certain patterns. Artificial intelligence models use decision-making algorithms to learn from the training and data and apply that learning to achieve specific pre-defined objectives.

Reveal AI supports model building in various ways. To help speed up the model building process, users can use a keyword list, or export a CSV file from previous cases to build the initial model. This document provides step-by-step instructions on how to create models using these two methods.

Process Flow

217 - 01 - AI Model Process Flow

Workflow Steps

Reveal supports importing models directly from a CSV file. The CSV file can be the one exported from a Brainspace portable model, or you can create one manually. If you are creating one manually, make sure the CSV file contains the following two columns:

  • term
  • weight

Note: The column names and terms need to be in lower case.

1.            Collect Terms and Weights

There are two ways to collect the materials that can be used to build an AI model.

  1. Use a self-generated keywords list. This could be a keyword list used by review team to search for relevant documents.
    Tip: Wildcard and proximity searching terms are not supported, so convert them to straight keywords before using them in the CSV file. 
  2. Export the features from an existing classifier in Reveal:
    1. Go to the Features tab in Classifier Details, click Download to download the CSV file:
    217 - 02 - Download Classifier Features

Note: The downloaded CSV will contain extra columns and will need to be updated before we can use it directly. See steps below for more details.

 

Raw Feature

Feature

Type

Model Weight

toni graham

toni graham

keyword

0.05834

mark broadfoot

mark broadfoot

keyword

0.03369

broadfoot

broadfoot

keyword

0.03256

mins

mins

keyword

0.03174

resume

resume

keyword

0.0301

sgrady

sgrady

document author

0.02756

cliff baxter

cliff baxter

keyword

0.02646

don

don

keyword

0.0262

phillip k.allen

phillip k.allen

keyword

0.02536

phillip k.

phillip k.

keyword

0.02524

2.            Update and fine-tuning

Once the CSV file is created, we can then add the term and weights as desired. There are two types of terms, (1) key word terms and (2) metadata features.

  • Keyword: supports multiple words, for example “natural gas” can be one term.
  • Metadata: lists the metadata field name and value in the following format:
    [“field”:”value”]

For example, the following metadata term is for metadata field “From”:

["SENDER","kaminski <kaminski>"]

To view a list of possible metadata fields, see the If you want to search this field...use this name in the RQL Query table in the link below; the 2nd column can be used as fields:

https://www.revealdata.com/knowledgebase/reveal/reveal-query-language-basics

We recommend the weights assigned to each term be in the range of -0.99 to 0.99, with at least one positive and one negative entry.

Here is an example:

term,weight

takata,0.5

airbag,0.9

recall,0.4

airbags,0.3

automaker,0.3

faulty,0.3

inflators,0.2

["SENDER","kaminski <kaminski>"]:0.8

window,-0.1

working,-0.1

yesterday,-0.4

3.            Import into Model Library

The CSV file can be imported into the model library using the IMPORT MODEL button below:

217 - 03 - Import Model button

Browse to the CSV file and import it to the model library.

Once it is imported, click the Settings 217 - 04 - Classifier Settings button button and update name/description:

217 - 05 - Model Settings

The saved model can then be applied to a classifier in the same way other library models are.

4.            Test and Validating

Follow the steps below to Test and Validate the new model:

  1. Create a new classifier in a new case.
  2. Go to the Edit Classifier
  3. Add the imported model to the classifier under AI Library Models.
    217 - 06 - Add Model in Classifier Settings
  4. Either Run Full Process using the button at the bottom of the AI Library Models section after selecting, or go to Classifier Details and click Run Full Process.
    217 - 07 - Run Full Process
  5. Confirm scores are populated after classifier finishes.

 

Last Updated 6/12/2024