Return to search results
Anomaly Detection in Sequences
We present a set of novel algorithms which we call sequenceMiner, that detect and characterize anomalies in large sets of high-dimensional symbol sequences that arise from recordings of switch sensors in the cockpits of commercial airliners. While the algorithms we present are general and domain-independent, we focus on a specific
problem that is critical to determining system-wide health
of a fleet of aircraft. The approach taken uses unsupervised clustering of sequences using the normalized length of he longest common subsequence (nLCS) as a similarity measure, followed by a detailed analysis of outliers to detect anomalies. In this method, an outlier sequence is defined as a sequence that is far away from a cluster. We present new algorithms for outlier analysis that provide comprehensible indicators as to why a particular sequence is deemed to be an outlier. The algorithm provides a coherent description to an analyst of the anomalies in the sequence when compared to more normal sequences.
The final section of the paper demonstrates the effectiveness of sequenceMiner for anomaly detection on a real set of discrete sequence data from a fleet of commercial airliners.
We show that sequenceMiner discovers actionable and
operationally significant safety events. We also compare
our innovations with standard HiddenMarkov Models, and
show that our methods are superior
Complete Metadata
| @type | dcat:Dataset |
|---|---|
| accessLevel | public |
| accrualPeriodicity | irregular |
| bureauCode |
[
"026:00"
]
|
| contactPoint |
{
"fn": "Ashok Srivastava",
"@type": "vcard:Contact",
"hasEmail": "mailto:ashok.n.srivastava@gmail.com"
}
|
| description | We present a set of novel algorithms which we call sequenceMiner, that detect and characterize anomalies in large sets of high-dimensional symbol sequences that arise from recordings of switch sensors in the cockpits of commercial airliners. While the algorithms we present are general and domain-independent, we focus on a specific problem that is critical to determining system-wide health of a fleet of aircraft. The approach taken uses unsupervised clustering of sequences using the normalized length of he longest common subsequence (nLCS) as a similarity measure, followed by a detailed analysis of outliers to detect anomalies. In this method, an outlier sequence is defined as a sequence that is far away from a cluster. We present new algorithms for outlier analysis that provide comprehensible indicators as to why a particular sequence is deemed to be an outlier. The algorithm provides a coherent description to an analyst of the anomalies in the sequence when compared to more normal sequences. The final section of the paper demonstrates the effectiveness of sequenceMiner for anomaly detection on a real set of discrete sequence data from a fleet of commercial airliners. We show that sequenceMiner discovers actionable and operationally significant safety events. We also compare our innovations with standard HiddenMarkov Models, and show that our methods are superior |
| distribution |
[
{
"@type": "dcat:Distribution",
"title": "Anomaly_Detection-2008.pdf",
"format": "PDF",
"mediaType": "application/pdf",
"description": "Anomaly_Detection-2008.pdf",
"downloadURL": "https://c3.nasa.gov/dashlink/static/media/algorithm/Anomaly_Detection-2008.pdf"
},
{
"@type": "dcat:Distribution",
"title": "Anomaly_Detection_in_Sequences.pdf",
"format": "PDF",
"mediaType": "application/pdf",
"description": "Anomaly_Detection_in_Sequences.pdf",
"downloadURL": "https://c3.nasa.gov/dashlink/static/media/algorithm/Anomaly_Detection_in_Sequences.pdf"
},
{
"@type": "dcat:Distribution",
"title": "DataMiningResults-ARMDSeminar.pdf",
"format": "PDF",
"mediaType": "application/pdf",
"description": "DataMiningResults-ARMDSeminar.pdf",
"downloadURL": "https://c3.nasa.gov/dashlink/static/media/algorithm/DataMiningResults-ARMDSeminar.pdf"
},
{
"@type": "dcat:Distribution",
"title": "Data_Mining_for_ISHM.pdf",
"format": "PDF",
"mediaType": "application/pdf",
"description": "Data_Mining_for_ISHM.pdf",
"downloadURL": "https://c3.nasa.gov/dashlink/static/media/algorithm/Data_Mining_for_ISHM.pdf"
},
{
"@type": "dcat:Distribution",
"title": "sequences.pdf",
"format": "PDF",
"mediaType": "application/pdf",
"description": "Discovering Atypical Flights in Sequences of Discrete Flight Parameters",
"downloadURL": "https://c3.nasa.gov/dashlink/static/media/algorithm/sequences.pdf"
}
]
|
| identifier | DASHLINK_3 |
| issued | 2010-09-09 |
| keyword |
[
"ames",
"dashlink",
"nasa"
]
|
| landingPage | https://c3.nasa.gov/dashlink/resources/3/ |
| modified | 2025-03-31 |
| programCode |
[
"026:029"
]
|
| publisher |
{
"name": "Dashlink",
"@type": "org:Organization"
}
|
| title | Anomaly Detection in Sequences |