Application of Multi-label classification to HAZOP texts

Main idea - using Ml-methods to develop a model to make some predictions based on a text description of a hazardous event in HAZOP worksheets.

HAZOP (Hazard and Operability Study) is a common method for identifying hazards in the operation of existing and in the design of new facilities. The HAZOP procedure is a sequential and routine analysis using the brainstorming method. So, during HAZOP, a large amount of information is recorded in text form, which can later be used to train various models.

The goal of this project is to use machine learning methods (deep learning in particular) to develop a model to predict the expected level of severity of consequences based on a text description of a hazardous event in HAZOP worksheets.

Multi-label* text classification was implemented for 5 categories. *(due to the specifics of the task, as well as due to the potential belonging of the text to different categories)

The original dataset includes a mix of data from various HAZOP procedures.

for english-language data BERT was used (https://huggingface.co/bert-base-cased) for russian-language data ruBERT was used (https://huggingface.co/sberbank-ai/ruBert-base)

Main steps:

Data collection (previously done)
EDA
Data preprocessing
Text tokenization
NN for multi-label classification
Model training
Validation & tests

Main results:

85% Average accuracy (i.e., the proportion of correctly labeled objects) for both for both English and Russian-language data
65% Average accuracy on a test object (on new data)

This .ipynb file contains only the code for English-language data.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
hazop-mlc-eng-data.ipynb		hazop-mlc-eng-data.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Application of Multi-label classification to HAZOP texts

About

Releases

Packages

Languages

AntonWeaver/hazop_mlc

Folders and files

Latest commit

History

Repository files navigation

Application of Multi-label classification to HAZOP texts

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages