NOVA: Multi-Modal Video Annotation System
Overview
NOVA (Nursing Operational View of Actions) is a web-based video annotation framework designed for multi-modal data analysis. Version 0.2 implements a synchronized playback system for multiple high-fidelity video streams with integrated data overlays, supporting gaze tracking, heatmaps, and IMU sensor visualization.
Key Features
Multi-Modal Synchronization
- Multiple Video Streams: Load and synchronize multiple video files simultaneously
- Data Overlays: Overlay gaze tracking, heatmaps, and IMU data directly on video
- Time-Synchronized Playback: Frame-accurate synchronization across all modalities
Annotation System
- Layer-Based Annotations: Organize annotations into customizable layers
- Keyboard Shortcuts: Rapid annotation with keyboard-driven workflow
- Export Support: Export annotations in standard formats
Data Support
- Gaze Tracking: Native support for Tobii Pro Glasses 3 data
- IMU Visualization: 5-second rolling window waveform display
- Gzip Support: Direct client-side decompression of .gz files
Technical Architecture
Client-Side Processing
NOVA is a fully client-side application requiring no server infrastructure:
- Pure JavaScript: No frameworks, maximum compatibility
- Local File Processing: Videos and data stay on your machine
- Responsive Design: Adaptive layout for various screen sizes
Canvas Overlay System
The overlay system uses a four-tier canvas sizing strategy:
- Base Layer: Video element with
object-fit: contain - Overlay Canvas: Positioned absolutely over video area
- Letterbox Detection: Automatic calculation of video display bounds
- Coordinate Mapping: Accurate translation from data coordinates to display pixels
Use Cases
NOVA is designed for:
- Nursing Education Research: Analyze student performance during simulations
- Eye-Tracking Studies: Visualize gaze patterns over procedural videos
- Behavioral Analysis: Annotate and categorize actions in video recordings
- Multi-Sensor Integration: Combine video with IMU and physiological data
Quick Start
- Open NOVA: Click the launch button above or navigate to the tool
- Add Video: Click βAdd Dataβ and select your video file(s)
- Add Overlays: Load gaze or IMU data files
- Annotate: Select annotation types and use keyboard shortcuts (S/E) to mark segments
- Export: Save your annotations for analysis
Keyboard Shortcuts
| Key | Action |
|---|---|
Space |
Play/Pause |
β / β |
Previous/Next Frame |
S |
Start Annotation |
E |
End Annotation |
Supported Formats
Video
- MP4, WebM, MOV, AVI
Data Files
- CSV, TSV (gaze coordinates)
- JSON Lines (Tobii Pro Glasses 3)
- Gzip compressed files (.gz)
Technology Stack
- Frontend: Vanilla JavaScript, CSS3
- Decompression: Pako for .gz support
- Fonts: Inter (Google Fonts)
- No Dependencies: Fully self-contained
Future Development
Planned enhancements include:
- Additional sensor modality support
- Collaborative annotation features
- Machine learning-assisted annotation suggestions
- Extended export formats
Authors
- Hanchen David Wang - Vanderbilt University