Table of Contents

1 Requirements

See Requirements.

2 Getting Starting

Firstly make sure that you have all the necessary requirements. If you are on working on Linux either install the RPM package of ASR-Builder or extract the source from the source archive. Run

python setup.py install

to install the package from the source. Windows users need to run the installer, ASR-Builder-0.1.win32.exe, which will take you through a step-by-step process to install the application.

To run ASR-Builder, in Linux, type:

asr-builder

on the command line. If no CLI arguments are provided, ASR-Builder will start in in its GUI mode. To run ASR-Builder on the command-line, one needs to use one of the CLI command options. ASR-Builder has 9 command options: -P/--project, -A/--acousticmodel, -L/-languagemodel, -R/--runscript, -V/--removemodel, -M/--modify, -I/--ViewInfo, -C--configure and -h/--Help. ASR-Builder has more variable options that need to be used in combination with the command options. Please refer to the CLI help for more information on these options (asr-builder -h/--help).

For Windows users, the script is located under '<Python install path>\Scripts'. Use Python to run the script ('asr-builder'). All the command-line arguments given above will work as well.


(back to the top)

3 ASR-Builder Graphical User Interface

Before one can create, train or test acoustic and langauge models or do anything else for that matter, a project needs to be created. ASR-Builder can only have one project open at a time. But each project can have multiple acoustic and language models.

The main window (shown in Figure 1) has two tree views. The 'Project models' view, on the left-hand side, contains the acoustic and language models in a project. The 'Selected model contents', on the right-hand side, shows the files and folders within the model selected (highlighted) in the 'Project models' view. The 'Remove' button allows the user to remove the selected model. The 'Refresh' button refreshes the list of files and folders that a model contains. The 'View' button opens the selected file in a basic editor.

ASR-Builder's main window

Figure 1: ASR-Builder main window, with a project already open.

3.1 Projects

3.1.1 Creating

To create a project, select the 'New' option from the 'Project' menu, or press 'Ctrl + N'. This will open the dialog (see Figure 2) which allows a user to create a new project. The dialog requires a name for the project and a directory location, where the project will be saved.

New project window

Figure 2: The new project window.

3.1.2 Opening

To open a project, select the 'Open' option from the 'Project' menu, or press 'Ctrl + O'. This will open the dialog (see Figure 3) which allows a user to open an existing project. The dialog requires the name of the project file (.proj) for the existing project that should be opened.

Open project window

Figure 3: The open project window.

3.1.3 Saving/Closing

There is no need to save a project as the system autmatically saves all changes. To close a project, select the 'Close' option from the 'Project' menu, or press 'Crtl + C'. If a you close a project, you will be prompted to save the project before it is closed.

3.1.4 Removing

To remove/delete a project, one needs only to delete the directory where the project is saved.


(back to the top)

3.2 Acoustic and Language Models

3.2.1 Creating an Acoustic Model

To create an acoustic model, make sure that project is open, then select the 'Acoustic' option from the 'Models' menu, or press 'Ctrl + A', this will bring up the dialog (see Figure 4). There are several fields that need to be filled in. The first field is the name of the model, note that no two models can have the same name. The next field is the dictionaries field, you can add one or more dictionaries for the model, or even a file that contains a list of the dictionaries to use. A file containing a list of dictionaries needs to have a '.list' extension or else it will be treated as a dictionary. For the audio data, either the directories containing the training and test data must be added to the respective fields or a file containing a list of audio files to use. If you check the 'MFCCs' box, the system will treat the audio files as being already converted into MFCC format. Specifying an extension is optional but if no extension is specified then all the files in the specified directories are assumed to be audio files. For the transcriptions, either the directories containing the training and test transcriptions must be added to the respective fields or a file containing a list of transcription files to use, as was the case with the audio data. Specifying an extension is optional but if no extension is specified then all the files in the specified directories are assumed to be transcriptions. Specifying extensions is useful if a directory contains different types of files and data. If you have a file containing questions (for tying bi-phones and tri-phones), it can be specified in the relevent field. The questions file is optional and can be added later to an acoustic model. The type of acoustic model is determined by the script you run to train the model.


Important: Altough specifying extensions for data is optional the data is expected to be in certain formats. Audio data is expected to be RIFF Wave files, if you want to use a different format, then you should change the parameters for HCopy. Transcriptions are expected to be plain text although ASR-Builder can remove punctuation and change the case, if specified.

Acoustic model window

Figure 4: The new acoustic model window.

3.2.2 Creating a Language Model

To create an language model, make sure that a project is open, then select the 'Language' option from the 'Models' menu, or press 'Ctrl + L', this will bring up the dialogue. There only two fields that need to be filled in. The first field is the name of the model, note that no two models can have the same name. The second field is the source of data for the language model. This can be a file or a directory containing multiple files. What data you provide depends on the type of language model you wish to create. The type of language model is determined by the script you run to train the model.

Language model window

Figure 5: The new language model window.

3.2.3 Running a Script

Once an acoustic or a language model has been created, you can run various scripts to train, test, recognise or do pretty much anything else that you require. ASR-Builder comes with a few basic scripts that you can use. You are also free to create your own scripts. To run a script select the 'Run script' option from the 'Tools' menu, or press 'Ctrl + R', this will open a dialogue (see Figure 6). You can choose a script from one of two locations. The first, and default, is an internal ASR-Builder script. These scripts come with ASR-Builder and provide basic training, testing and recognition operations. The second option is a custom script, these scripts are created by you and located in any specific directory and do not have any specific extension. Once a script is selected and you press the 'Run' button, a window is opened displaying the contents of the selected script as well as the HTK configuration parameters for the currently selected model. For more detailed information on creating scripts, see Section 7.

Run script window

Figure 6: The run script window.

3.2.4 Modifying a Model

To modify a model, select the model in the 'Project models' tree, on the left side of the main window, and select the 'Modify' option from the 'Models' menu, or press 'Ctrl + M'. This will display a dialogue (see Figure 7), where you can select the type of data, the data, the tag for the data and where the data should be stored. The type of data can be audio data, transcriptions, a file or a folder. The reason for having audio data and transcriptions is that ASR-Builder will do the same processing of these data sources as when a new acoustic model is created. The 'Tag' field allows you to enter the tag that you wish the new data is associated with in a model. The source/location of the data is entered in the 'Data' field. An optional field, for 'File' or 'Folder' data types only, is the 'Copy location' field. This allows you to specify a specific location within the model.

Modify window

Figure 7: The modify window.

3.2.5 Configuring a Model

To configure a model, select the model in the 'Project models' tree, on the left side of the main window, and select the 'Configure' menu from the 'Models' menu. This will give you a menu with three choices. You can load or save a configuration or view the current configuration and make changes to it. The 'Load Config' ('Ctrl+F') option allows you load a file containing multiple configuration parameters for multiple HTK tools. The format of the file is, one configuration parameter per line, in the form 'TOOL:PARAMETER:VALUE:DELETE'. The fields are as follows:

To save the configuration to a file, select 'Save Config' or press 'Ctrl + G'. To view the entire configuration, select 'View Config' or press 'Ctrl+W'. Each tab in the window is for a separate HTK tool. The configuration can also be changed in the view configuration dialog.

Configuration window

Figure 8: The configuration window.

3.2.6 Viewing Model Keys

To view the keys for a model and their associated values, select the model in the 'Project models' tree, on the left side of the main window, and select the 'View Keys' option from the 'Models' menu, or press 'Ctrl + K'.

Key view window

Figure 9: The model key viewing window.

3.2.7 Preprocessor Options

The preprocessor can be used to substitute text in transcriptions and check the transcriptions for missing words. The substitution mechanism uses regular expresion to search for text to replace. Each regular expression and replacement text pair is specified in the appropriatiate text box (see Figure 10) and then added to the list. If a regular expression exists in the list, it is overwritten. NB: Single quotes (') cannot be used. There is an option to perform a word check on the transcriptions. A file is created, that lists the transcriptions that have missing words and the words that are missing. The file is located in the model's folder and is called 'missing_words.txt'. Also it creates two sets of lists of transcriptions (for training and testing), one that can be used immediately to train/test a model and one with the transcriptions that contain the missing words.

Key view window

Figure 10: The preprocessor editing window.

3.2.8 Removing

To remove a acoustic model, select the model in the 'Project models' tree, on the left side of the main window, and either press the 'Remove' button or select the 'Remove' option in the 'Models' menu.


(back to the top)

5 Command Line Interface

All the options available for the ASR-Builder CLI.

OptionBrief description
Command Options
-P / --projectCreate a new project.
-A / --acousticmodelCreate a new acoustic model.
-L / --languagemodelCreate a new language model.
-V / --removemodelRemove an acoustic or language model.
-R / --runscriptRun a script.
-M / --modifyModify an exsiting acoustic model.
-I / --viewinfoDisplay information on the specified model.
-C / --configureChange the HTK configuration for the specified model.
-O / --preprocessorChange the preprocessor configuration or run the preprocessor for a model.
Variable Options
-aSpecifies the folders that contain the audio data or a list of files(training and test).
-cSpecifies a file to save or load for the HTK/preprocessor configuration
-dList of dictionaries.
-eExtension of the audio files.
-fSpecifies the type of information to display.
-gSpecifies the tag/label when modifying a model.
-hDisplays help.
-iIf specified, prevents the program from asking for user input.
-lSpecifies the location to store new data.
-mSpecifies that the audio files are already in MFCC format.
-nSpecifies the model name.
-pSpecifies the project file.
-qSpecify the questions file (for phone tying).
-rSpecifies script file to run.
-sSpecifies the data source for language model creation and modifying a model.
-tSpecifies the folders that contain the transcriptions or a list of files(training and test).
-uSpecifies the HTK tool and the configuration parameters to change for the tool.
-vIf specified, overrides any saved state information.
-wSpecifies the working directory.
-xSpecifies the extension for the transcription files.
-ySpecifies the type of data being added, when modifying a model.

4.1 Projects

4.1.1 Creating

To create a new project specify the -P/--project option. The -P option must be followed by the folder where the project should be created and the name of the project. The -w option, followed by the path of the working folder, can be specified before the -P option. The -w option is optional.

Example:

./asr-builder.py -w /home/user/ -P test/ ASR_Project

The project 'ASR_Project' will be created in '/home/user/test/ASR_Project'.

4.1.2 Opening, Saving and Closing

You cannot open a project directly on the command line, rather ASR-Builder opens, saves and closes the specified project when you run a command.

4.1.3 Removing

To remove/delete a project, one needs only to delete the directory where the project is saved.


(back to the top)

4.2 Acoustic and Language Models

4.2.1 Creating an Acoustic Model

To create a new acoustic model one needs to specify the -A/--acousticmodel option, preceeded by the required variable options. The required options are:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-d: The list of dictionaries for the project (either a list of files separated by spaces and enclosed in double quotation marks or a file that lists the dictionaries).
-q: The questions file for tying phones (only for BIPHONE and TRIPHONE types).
-a: The folders that contain the training and test audio data or files that contain lists of the audio files.
-m: Specifies that the audio files are already in MFCC format.
-t: The folders with the training and test transcriptions or files that contain lists of the transcriptions.
-e: The extension of the audio files, if none is specified ASR-Builder assumes all files in the specifed folders are audio files. [optional]
-x: The extension of the transcription files , if none is specified ASR-Builder assumes all files in the specifed folders are transcription files. [optional]
-w: The working directory. [optional]
If a model with the same name exists the program will terminate with the appropriate error message.

Important: Altough specifying extensions for data is optional the data is expected to be in certain formats. Audio data is expected to be RIFF Wave files, or in HTK MFCC format, if you want to use a different format, then you should change the parameters for HCopy. Transcriptions are expected to be plain text although ASR-Builder can remove punctuation and change the case, if specified.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -a data/atrain data/atest -t data/ttrain data/ttest -e wav -x txt -q data/eng_quests.hed -d "data/eng_main.dict data/eng_extra.dict" -A

This will open the 'ASR_Project' and create a model named 'newAM'. Training audio will be added from the '/home/user/data/atrain/' folder and testing audio from the '/home/user/data/atest/' folder. Only files with the '.wav' extension will be added. Transcriptions for training and testing will be added from '/home/user/data/ttrain/' and '/home/user/data/ttest/' respectively. Only transcriptions with the '.txt' extension will be added. The '/home/user/data/eng_quest.hed' file containing questions for phone tying will be added and lastly two dictionaries, '/home/user/data/eng_main.dict' and '/home/user/data/eng_extra.dict' will be added.

4.2.2 Creating a Language Model

To create a new language model one needs to specify the -L/--languagemodel option, preceeded by the required variable options. The required options are:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-s: The data source for the language model.
-w: The workingdirectory (optional).
If a model with the same name exists the program will terminate with the appropriate error message.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newLM -s data/grammar.txt -L

This will create a new language model named 'newLM' in the ASR_Project. It will add the file 'grammar.txt' to the model.

4.2.3 Running a script

To train, test or recognise with a model, specify the -R/--runscript option preceeded by the required variable options. For more detailed information on creating scripts, see Section 8.The variable options are:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-r: The name of the run script (either path or key).
-i: Ignore user input (optional).
-v: Override previously saved state information for a script (optional).
-w: The workingdirectory (optional).
Scripts can be specified by either a full path name or a key. The available ASR-Builder run scripts are:

During the process of parsing the script, should ASR-Builder encounter an error it will exit gracefully with an error message.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -r AM_TRAIN_MONOPHONE_BASIC -R

This will train the acoustic model named 'newAM' in the 'ASR_Project' project, with a basic monophone training script. You will be prompted to run the script with the given HTK configuration, to prevent the prompt from being displayed, specify the -i option.

4.2.4 Modifying a Model

To modify a model, specify the -R/--runscript option preceeded by the required variable options. The variable options are:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-g: Tag/label for the data.
-s: The location of the data source.
-y: The type of data being added. Acceptable values are FILE, FOLDER AUDIO and TRANS.
-l: The location (folder) to copy the new data to, in the model. Only applicable for FILE and FOLDER types. If the folder/location doesn't exist, it will be creaated. [optiona]
-c: Specifies the case that the text in the transcriptions must be changed to. Available options are u/U (uppercase), l/L (lowercase) and n/N (no change). [optional]
-o: Specifies the punctuation to remove from the text in the transcriptions. The punctuation is specfied as a string of alphanumeric characters (without any spaces). [optional]
-w: The workingdirectory (optional).
The -c and -o options are identical to those in Section 4.2.1.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -g NEW_DATA -s data/extra_data/ -y FOLDER -l extra/ -M

This will copy the folder '/home/user/data/extra_data/' to the 'newAM' model in the 'ASR_PRoject' project. The data will be copied to the 'extra' folder and its full path will be '/home/user/ASR_Project/newAM/extra/extra_data/'. The data will be referenced by the tag/key 'NEW_DATA' in the model's file.

4.2.5 Configuring a Model

To configure the HTK parameters for a model through the CLI, you can specify the relevant parameters for a HTK command and their values, using the -C/--configure option. The -C command option takes a single argument. This argument can have one of the following values, 'MODIFY', 'SAVE', or 'LOAD' (case is not important). Also, the -C command option must be preceeded by the following variable options:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-u: The name of the HTK command to change the configuration for and the parameters and their values to add, modify or delete. You can only specify one HTK command at a time but many parameters for that command. The format is as follows: HTK_COMMAND=param_1:value_1:DEL,... ,param_n:value_n:DEL. If DEL is set to 't' or 'T' then the parameter will be deleted from the configuration and you can omit the value. Values may be omitted but do not leave out any the colons, example: HCopy=SOURCERATE::t will delete the parameter SOURCERATE from the HCopy configuration. [Used if 'MODIFY' is specified]
-c: The name of the file to save/load the HTK configuration to/from. [Used if 'SAVE' or 'LOAD' is specified]
-w: The workingdirectory (optional).

The format for the file is (when using 'SAVE' or 'LOAD'), one configuration parameter per line, in the form 'TOOL:PARAMETER:VALUE:DELETE'. The fields are as follows:

Example 1:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -u HCopy=SOURCERATE:16000:f,NUMCEPS:22:f -C modify

This will add or modify the SOURCERATE and NUMCEPS parameter in the HCopy configuration for the 'newAM' model.

Example 2:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -j /home/user/htk_full.cfg -C load

This will load the configuration in the file '/home/user/htk_full.cfg' into the 'newAM' model.

4.2.6 Viewing Model Keys and Other Information

To view certain information about a model one needs to specify the -I/--viewinfo option. The -I command option must be preceeded by the following variable options:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-f: The type of information to display. Acceptable options are: a/A (all information), c/C (HTK configuration parameters) and m/M (the model keys/tags).
-w: The workingdirectory (optional).
If there is an error the program will terminate with the appropriate error message.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -f A -I

This will display all the information (the model's keys and HTK configuration) for the 'newAM' model from the 'ASR_Project' project.

4.2.7 Preprocessing Options

To load or save preoprocessor configuration options or to run the preprocessor, use the -O/--preprocessor option. The -O command option takes a single argument. This argument can have one of the following values, 'SAVE', 'LOAD' or 'RUN' (case is not important). The -O command option must be preceeded by the following variable options:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model. [Only used with 'RUN']
-c: The name of the file to save/load the preprocessor configuration to/from. [Used if 'SAVE' or 'LOAD' is specified] -w: The workingdirectory (optional).
If there is an error the program will terminate with the appropriate error message.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -O RUN

This will run the preprocessor for the 'newAM' model from the 'ASR_Project' project.

4.2.8 Removing

To remove a model one needs to specify the -V/--removemodel option. The -V command option must be preceeded by the following variable options:
-p: The file for the project in which to create the new acoustic model.
-n: The name of the new acoustic model.
-w: The workingdirectory (optional).
If there is an error the program will terminate with the appropriate error message.

Example:

./asr-builder.py -w /home/user/ -p test/ASR_Project.proj -n newAM -V

This will remove the 'newAM' model from the 'ASR_Project' project.


(back to the top)

5. Configuration

Currently there is no configuration of the the ASR-Builder system itself, but there are configuration options for HTK, on a per model basis. To changes these configuration options, see Sections 3.2.5 and 4.2.5.


(back to the top)

6. General Usage

For each new model a file containing default contexts (for use with the 'FILE_FROM_CONTEXT' function, see Section 7.1.3) is created. This file can be found in the 'contexts' folder under the new project and is called 'contexts_default.txt'. You can edit these contexts as needed or create your own file containing new contexts.

The format for the preprocessor configuration file is as follows. So far there are only two parameters, 'REGEX' and 'WORDCHK'. Each parameter is followed by '==' and then the value. The value for 'REGEX' is a comman separate list of regular expression and substitution text pair. Eache pair is separated by a colon. And each regular expression and substitution text value is delimited by a double quote (") at each end, and each regular expresion is preceeded by a 'r'. Example, REGEX==r"\.":"period",r"\,":"comma". The value for 'WORDCHK' can be true/t/false/f (case is not important).


(back to the top)

7. ASR-Builder Run Scripts

ASR-Builder performs training, testing, recognition or any other task using scripts. There are a handful of basic scripts that are provided with ASR-Builder. These are shown in the table below.

Script KeyShort Description
AM_TRAIN_MONOPHONE_BASICBasic training script for a monophone based acoustic model.
AM_TRAIN_MONOPHONE_MFCCTraining script for a monophone based acoustic model, if the audio is already in MFCC format.
AM_TRAIN_BIPHONE_BASICBasic training script for a biphone based acoustic model.
AM_TRAIN_TRIPHONE_BASICBasic training script for a triphone based acoustic model.
AM_TEST_MONOPHONE_BASICBasic testing script for a monophone based acoustic model.
AM_TEST_BIPHONE_BASICBasic testing script for a biphone based acoustic model.
AM_TEST_TRIPHONE_BASICBasic testing script for a triphone based acoustic model.

Users are by no means confined to using the default scripts and can modify the existing scripts or create entirely new ones. The script files have a .rs extension, although this is by no means a steadfast requirement. The commands, and their parameters, that can be used in the scripts are detailed below. The commands can generally be used in any order that is sensible. The test and recognition scripts are separate from the training scripts but if one wants, you can combine these into a single script.


(back to the top)

7.1 Basics

7.1.1 Script State Saving

It is important to note that ASR-Builder has a very basic state saving mechanism when running scripts. When a script is run the system saves the number of the last line in the script to be successfully run and a checksum of all the lines run prior to the last line. So if you run the same script again it will run only from the first line that was not run previously. This mechanism is meant to prevent the need to re-run entire scripts due to syntax/value errors in the scripts.

If there is a change in the script before the last line that was successfully run, in other words the checksum is different, then the entire script will be run again. Also you can override the saved state by providing the '-v' argument on the command-line or ticking the crelevant checkbox in the GUI (see Figure 11 below). In the GUI the information display dialogue, shown just before a script is run, shows the script that will be run and lines that are highlighted red are lines previously run successfully.

Information display dialogue

Figure 11: The information display dialogue, showing the script that will be run.

7.1.2 Script Syntax

The scripting interface for ASR-Builder is very simple but rather powerful. It only has three special operators, namely ++, % and $. The ++ operator allows for the concatenation of strings in a script file. Generally each string is either a folder or file name. When the % operator preceeds a string it indicates that a string is a key value for the model's dictionary and the corresponding lookup is performed. The $ operator preceeds a string that should be concatenated to the previous string to form a single value. In the script files, lines beginning with a # are regarded as comments. The examples below should make it clear how to use the operators.

Example:

The operators can be used in any order to create values. The reason for allowing the use of dictionary keys is to streamline the script file and reduce unnecessary repetion of folder or file names.

The general layout of a command in a script is: COMMAND arg1 ... argN logFile

The 'COMMAND' is always uppercase and may contain underscores. The list of available commands is detailed in sections 6.1.2 and 6.1.3 below. The number of arguments depends on the command and can be zero. The 'logFile' argument is always required and specifies a name for the log file, do not specify a folder just the file name. The syntax for using external python scripts to override internal functions is slightly different and is explained in section 6.2 below.

Important: Currently ASR-Builder does not support line continuation to a new line, in other words a single command and all its arguments must be on a single line (Yes, this does make difficult to read the scripts sometimes).

ASR-Builder also supports nested for-loops. The syntax for a for-loop is as follows:
FOR m TO n IN variable:
command 1
...
command p
END variable

The loop counter runs from n to m - 1. If the variable appears in the code in the loop, it will be substituted by the current value of the loop counter. Code within a loop can be indented to whatever level you like.

Example:

FOR 0 TO 3 IN loop1:
NEXT_MODEL YES reest.log
HEREST -S %SCRIPT_TRAIN -I %MLF_PHONES_TRAIN -H %HMM_CURRENT_MACROS -M %HMM_NEXT %MONOPHONES reest.log
CURRENT_MODEL reest.log
END loop1

This will loop three times (the loop counter will run from 0 to 2) and each time re-estimate the current model and create a new one.

Available for the HTK functions is a method to capture output from these functions. The method uses regular expressions to get the required output, the output is then stored temporarily and can be accessed through the use of keywords, in the same way other information is accessed in the ASR-Builder scripting language. The captured output is valid for a single run through a script. Do not use keyword names that are already in use by the model as the output associated with these will be ignored. Also you can capture as many different pieces of output as you require. The syntax is fairly simple, the keyword and regular expresion pairs are listed between angular brackets after the HTK command and it parameters have been specified (after the log file). Each pair is separated by a comma, each keyword and regular expression in a pair is separated by a colon and each regular expression is enclosed by single quote marks. The syntax looks as follows:

<KEYWORD:'regular expression'>

An example:

HRESULT ... results.log <CORRECT:'Corr=.*(?= )',ACC:'Acc=.*'>
This will get the percentage correct from the output from HResults and store it under the keyword 'CORRECT' and the accuracy from the output and store it under the keyword 'ACC'.

7.1.3 ASR-Builder Assistive Functions

ADD_KEY key value log_file

Used to add a new entry or modify an existing in the model.

Description of arguments:

NEXT_MODEL overwrite next_hmm log_file

Used to create the folder for next model and increment the necessary entries in the model.

Description of arguments:

CURRENT_MODEL log_file

Used to set current model and update the necessary entries in the model.

Description of arguments:

COPY source dest log_file

Used to copy a file or folder.

Description of arguments:

MKDIR folder log_file

Used to create a folder.

Description of arguments:

CONCAT infile1 infile2 outfile log_file

Used to concatenate two files. The contents of the concatenated file is sorted and duplicate items are removed. Each line is considered an item.

Description of arguments:

GET_FILE_LIST folder ext key log_file

Used to get a list of files and add the list to a key in the model. The files in specified folder are concatenated to form a string, the string is added to the model.

Description of arguments:

FILE_FROM_CONTEXT file_name context subst_type subst_source log_file

Used to create a file using a context provided in another file. The function allows substitution from either a parameter in the script file or from another file. Also any keys will be replaced by their values. Note that the 'subst_type' and 'subst_source' parameters must be used in conjunction with one another.

Description of arguments:

A context is defined as follows:

DEFINE: context_name
KEY = KEY_NAME
FILE = edit_script_file_name
"contents"
"more contents"
"@repeated contents with & values from subst_source"
END: context_name

The context is bounded by 'DEFINE:' and 'END:' statements followed by the context name. The 'KEY =' statement provides a name for the key to be used in the model's dictionary and the 'FILE = ' statement provides the name for the edit script file. The contents of the edit script is given between double quotation marks on multiple lines. A line of content that begins with a '@' will be repeated for every value in the 'subst_source'. Where the value replaces the '&' character in the line of contents. This is useful when a line has to be repeated for each HMM in a list. Also the same syntax used in the ASR-Builder scripts can be used.

Example: Given a context as follow:
DEFINE: mkbi_hed
KEY = EDIT_MKBI_HED
FILE = %DIR_CONFIG++mkbi.hed
"CL %DIR_DEF++biphones.phn"
"@TI T_& {(&+*).transP}"
END: mkbi_hed

The 'subst_source' contains the following:
D
CH
A
T

And '%DIR_CONFIG' contains /home/test/config and '%DIR_DEF' contains /home/test/def

The new file will look as follows:
CL /home/test/def/biphones.phn
TI T_D {(D+*).transP}
TI T_CH {(CH+*).transP}
TI T_A {(A+*).transP}
TI T_T {(T+*).transP}

CREATE_WORDS_LIST file_name folder special log_file

Used to get a list of all the unique words used in a set of transcriptions. The function adds in the words: SENT-START, SENT-END, SILENCE.

Description of arguments:

CREATE_MLF file_name folder log_file

Used to create a MLF for a set of transcriptions.

Description of arguments:

CREATE_LIST_FROM_MLF infile_name outfile_name log_file

Used to create a list of all the unique words/phones used in a MLF.

Description of arguments:

CREATE_SCRIPT file_name folder ext log_file

Used to create a HTK script file, containing a list of files.

Description of arguments:

FIX_HMM_LIST infile_name outfile_name log_file

Used to remove the short pause ('sp') from the initial HMM list if it exists. The function will create another list of HMMs with the short pause ('sp') included. The list of HMMs without the short pause are written to the file specified by 'infile_name'.

Description of arguments:

CREATE_HMM hmm_name target vec_size states fw bk rn log_file

Used to create a prototype HMM.

Description of arguments:

CREATE_MACRO vec_size target log_file

Used to create the macro file for use with HERest. The function will create a macro in the file pointed to by the 'HMM_NEXT_MACROS' key. The file used for input is the 'vFloors' file created by the 'HCompV' function.

Description of arguments:

CREATE_INIT_MMF hmm_list model_name vec_size target log_file

Used to create the initial MMF file containing all the HMMs. The function will create a MMF in the file pointed to by the 'HMM_NEXT_MMF' key. The files used for input are the initial HMM, created by the 'HCompV' function, and the list of HMMs.

Description of arguments:

INSERT_TEE_DEF infile_name outfile_name hmm_name copy_from copy_state log_file

Used to insert a Tee-definition into the given MMF-file. The norm is to create a 'sp' HMM model from the 3rd state of 'sil'.

Description of arguments:

CREATE_FULL_LIST infile_name outfile_name log_file

Used to create a full list of phone combinations found in a dictionary.

Description of arguments:

CREATE_CLUSTER_TREE outfile_name type threshold cmd states trees tiedlist fulllist stats log_file

Used to create the tree for clustering the biphones/triphones when tying.

Description of arguments:

CREATE_FLAT_GRAMMAR infile_name outfile_name log_file

Used to create a simple flat grammar (language model) using a list of HMMs.

Description of arguments:

CREATE_SPECIAL_DICT file_name log_file

Used to create a dictionary with the silence, sent-start and sent-end entries.

Description of arguments:

PRUNE_AUDIO_DATA file_name log_file

Used to remove pruned audio data from the training script. The file should be the most recent MLF. The file names found will be used to create a new training script (will overwrite the existing script).

Description of arguments:

7.1.4 HTK Wrapper Functions

The HTK commands take paramters in exactly the same manner as on the CLI except one can use the ASR-Builder syntax to specify arguments and the last parameter must be the name of a log file.
The HTK commands are as follows:
CLUSTER
HBUILD
HCOMPV
HCOPY
HDMAN
HEREST
HHED
HINIT
HLED
HLIST
HLMCOPY
HRESCORE
HLSTATS
HPARSE
HQUANT
HREST
HRESULTS
HSGEN
HSMOOTH
HVITE
LADAPT
LBUILD
LFOF
LGCOPY
LGLIST
LGPREP
LLINK
LMERGE
LNEWMAP
LNORM
LPLEX
LSUBSET


(back to the top)

7.2 Advanced Options

7.2.1 Overloading Internal Functions

You can override some of the assistive functions in ASR-Builder. In order to override a function, one needs to write the new function in a separate Python script. The new function needs to accept 3 variables, a dictionary, a list and a string in that order. An example of a function header is given below:

def myFunction(myDict, myList, myString):

The dictionary variable is the model's dictionary, so that the overloaded function has access to the contents of the model. The list variable is a list of the parameters that were passed to the function in the script and the string is the name of the log file (given in the script). ASR_Builder will catch any exception raised by the overloaded function but it is advised to make sure that the exceptions give sufficient error information. The overloaded function is called from the script in the following manner:

[module path] COMMAND arg1 ... argN log_file

The 'module' is the name of the Python script and 'path' is the path to where the script is located. The 'COMMAND' parameter is the name of the command/function that is being overloaded, the 'arg1...argN' parameter are arguments passed to the command and 'log_file' is the name of the log_file. The functions that can be overridden are:

Function NameScript Command
createWordsListCREATE_WORDS_LIST
createMLFCREATE_MLF
createListFromMLFCREATE_LIST_FROM_MLF
createScriptCREATE_SCRIPT
fixHMMList FIX_HMM_LIST
createHMMCREATE_HMM
createMacroCREATE_MACRO
createInitMMFCREATE_INIT_MMF
insertTeeDefINSERT_TEE_DEF
createFullListCREATE_FULL_LIST
createClusterTreeCREATE_CLUSTER_TREE
createFlatGrammar CREATE_FLAT_GRAMMAR
createSpecialDictCREATE_SPECIAL_DICT
pruneAudioDataPRUNE_ADUDIO_DATA

Example:

In order to call this overridden function from a script, use:
[override /home/user/test] CREATE_SCRIPT arg1 arg2 test.log.
Where the function is located in a Python script named 'override.py', in the '/home/user/test' folder.

7.2.2 External Python Scripts

If you wish to create an entirely new Python module, but have access to the model's dictionary, then one needs to create an Python module that meets certain requirements. The module needs to have a new class that needs to accept a SQlite object and a string when its instantiated. The SQlite object contains the model's dictionary, so that the class has access to the contents of the model and the string is the name of the log file given in the script), you need to import SQliteModel from SQlite.py from the ASR-Builder package. ASR_Builder will catch any exception raised by the class but it is advised that your class catches exceptions or at the very least to make sure that the exceptions give sufficient error information.

The syntax to call a method from an external Python module is as follows:

[module function class path] EXTCLASS log_file

The 'module' argument is the name of the Python module, the 'function' argument is the name of the function to call from the class, the 'class' argument is the name of the class and the 'path' argument is the path to where the module is located. The 'EXTCLASS' parameter is the command that informs ASR-Builder to expect an external Python module and 'log_file' is the name of the log_file. The 'function' is the starting point of processing for your module. For example:

Example:

In order to call this external module, use:
[External start Outside /home/user/test] EXTCLASS test.log.
Where the module is located in a file named 'External.py', in the '/home/user/test' folder. The module contains a class called 'Outside' and the class has a function called 'start()'.


(back to the top)