ViTexOCR; a script to extract text overlays from digital video

Published by U.S. Geological Survey | Department of the Interior | Metadata Last Checked: March 03, 2026 | Last Modified: 2020-10-19T00:00:00Z

The ViTexOCR script presents a new method for extracting navigation data from videos with text overlays using optical character recognition (OCR) software. Over the past few decades, it was common for videos recorded during surveys to be overlaid with real-time geographic positioning satellite chyrons including latitude, longitude, date and time, as well as other ancillary data (such as speed, heading, or user input identifying fields). Embedding these data into videos provides them with utility and accuracy, but using the location data for other purposes, such as analysis in a geographic information system, is not possible when only available on the video display. Extracting the text data from imagery using software allows these videos to be located and analyzed in a geospatial context. The script allows a user to select a video, specify the text data types (e.g. latitude, longitude, date, time, or other), text color, and the pixel locations of overlay text data on a sample video frame. The script’s output is a data file containing the retrieved geospatial and temporal data. All functionality is bundled in a Python script that incorporates a graphical user interface and several other software dependencies.

Resources

2 resources available

Digital Data

XML

Visit Page
Original Metadata

XML

Download

Find Related Datasets

Search by Tags

Click any tag below to search for similar datasets

Complete Metadata

accessLevel	public
bureauCode	[ "010:12" ]
contactPoint	{ "fn": "Evan T. Dailey", "@type": "vcard:Contact", "hasEmail": "mailto:edailey@usgs.gov" }
description	The ViTexOCR script presents a new method for extracting navigation data from videos with text overlays using optical character recognition (OCR) software. Over the past few decades, it was common for videos recorded during surveys to be overlaid with real-time geographic positioning satellite chyrons including latitude, longitude, date and time, as well as other ancillary data (such as speed, heading, or user input identifying fields). Embedding these data into videos provides them with utility and accuracy, but using the location data for other purposes, such as analysis in a geographic information system, is not possible when only available on the video display. Extracting the text data from imagery using software allows these videos to be located and analyzed in a geospatial context. The script allows a user to select a video, specify the text data types (e.g. latitude, longitude, date, time, or other), text color, and the pixel locations of overlay text data on a sample video frame. The script’s output is a data file containing the retrieved geospatial and temporal data. All functionality is bundled in a Python script that incorporates a graphical user interface and several other software dependencies.
distribution	[ { "@type": "dcat:Distribution", "title": "Digital Data", "format": "XML", "accessURL": "https://doi.org/10.5066/F7833Q56", "mediaType": "application/http", "description": "Landing page for access to the data" }, { "@type": "dcat:Distribution", "title": "Original Metadata", "format": "XML", "mediaType": "text/xml", "description": "The metadata original format", "downloadURL": "https://data.usgs.gov/datacatalog/metadata/USGS.58dd56ace4b02ff32c685954.xml" } ]
identifier	http://datainventory.doi.gov/id/dataset/USGS_58dd56ace4b02ff32c685954
keyword	[ "CMGP", "Coastal and Marine Geology Program", "PCMSC", "Pacific Coastal and Marine Science Center", "U.S. Geological Survey", "USGS", "USGS:58dd56ace4b02ff32c685954", "computer science", "scientific software", "software development" ]
modified	2020-10-19T00:00:00Z
publisher	{ "name": "U.S. Geological Survey", "@type": "org:Organization" }
spatial	-180.0, -90.0, 180.0, 90.0
theme	[ "geospatial" ]
title	ViTexOCR; a script to extract text overlays from digital video