Return to search results
MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED SUBSPACE CLUSTERING
MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED
SUBSPACE CLUSTERING
MOHAMMAD SALIM AHMED, LATIFUR KHAN, NIKUNJ OZA, AND MANDAVA RAJESWARI
Abstract. There has been a lot of research targeting text classification. Many of them focus
on a particular characteristic of text data - multi-labelity. This arises due to the fact that a
document may be associated with multiple classes at the same time. The consequence of such a
characteristic is the low performance of traditional binary or multi-class classification techniques on
multi-label text data. In this paper, we propose a text classification technique that considers this
characteristic and provides very good performance. Our multi-label text classification approach
is an extension of our previously formulated [3] multi-class text classification approach called
SISC (Semi-supervised Impurity based Subspace Clustering). We call this new classification model
as SISC-ML(SISC Multi-Label). Empirical evaluation on real world multi-label NASA ASRS
(Aviation Safety Reporting System) data set reveals that our approach outperforms state-of-theart
text classification as well as subspace clustering algorithms.
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| accrualPeriodicity | irregular |
| bureauCode |
[
"026:00"
]
|
| contactPoint |
{
"fn": "Elizabeth Foughty",
"@type": "vcard:Contact",
"hasEmail": "mailto:elizabeth.a.foughty@nasa.gov"
}
|
| description | MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED SUBSPACE CLUSTERING MOHAMMAD SALIM AHMED, LATIFUR KHAN, NIKUNJ OZA, AND MANDAVA RAJESWARI Abstract. There has been a lot of research targeting text classification. Many of them focus on a particular characteristic of text data - multi-labelity. This arises due to the fact that a document may be associated with multiple classes at the same time. The consequence of such a characteristic is the low performance of traditional binary or multi-class classification techniques on multi-label text data. In this paper, we propose a text classification technique that considers this characteristic and provides very good performance. Our multi-label text classification approach is an extension of our previously formulated [3] multi-class text classification approach called SISC (Semi-supervised Impurity based Subspace Clustering). We call this new classification model as SISC-ML(SISC Multi-Label). Empirical evaluation on real world multi-label NASA ASRS (Aviation Safety Reporting System) data set reveals that our approach outperforms state-of-theart text classification as well as subspace clustering algorithms. |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "Paper 22 .pdf",
"format": "PDF",
"mediaType": "application/pdf",
"description": "MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED SUBSPACE CLUSTERING",
"downloadURL": "https://c3.nasa.gov/dashlink/static/media/publication/Paper_22_.pdf"
}
]
|
| identifier | DASHLINK_244 |
| issued | 2010-10-13 |
| keyword |
[
"ames",
"dashlink",
"nasa"
]
|
| landingPage | https://c3.nasa.gov/dashlink/resources/244/ |
| modified | 2025-03-31 |
| programCode |
[
"026:029"
]
|
| publisher |
{
"name": "Dashlink",
"@type": "org:Organization"
}
|
| title | MULTI-LABEL ASRS DATASET CLASSIFICATION USING SEMI-SUPERVISED SUBSPACE CLUSTERING |