ITEA is the Eureka Cluster on software innovation
ITEA is the Eureka Cluster on software innovation
ITEA 4 page header azure circular

Encrypted Traffic Classification Datasets

Project
20020 ENTA
Type
New service
Description

Eleven application detection datasets have been created for use in training and testing AI-based encrypted network traffic classification models.

Contact
Nur Zincir-Heywood
Email
nzincirh@dal.ca
Research area(s)
Encrypted Traffic Classification, Activity Detection, Traffic Analysis
Technical features

File Naming Convention: All files under the name {IMA-name}_encrypted_traffic.pcap contain the encrypted traffic resulting from the corresponding IMA. Similarly, all files under the name {IMA-name}_encrypted_traffic_flows.txt contain the text-based description of flows we used to build our models. No timestamps are added as they are already contained in the .pcap files.

They can be used to train/test/evaluate models: 1 Mobile Instant Messaging text chat traffic (publicly available at IEEE DataPort https://ieee-dataport.org/documents/encrypted-mobile-instant-messaging-traffic-dataset; 1 text-based Instant messaging group chat; 1 VoIP-based Instant messaging group chat; 2 datasets with streaming audio and video traffic and OTT applications; 3 datasets with Instant Messaging applications – wireline and mobile; 1 datasets with Google Applications; 1 dataset on social media captured using bots; 1 dataset with mix of streaming video, social media, interactive live stream.

Integration constraints

The data can be used for model training for Machine Learning or Deep Learning experiments.

Targeted customer(s)

Researchers in the area of Network Traffic Analysis. Researchers can be from Academia or Industry.

Conditions for reuse

The terms of reuse is dictated by IEEE Dataport licensing agreement.

Confidentiality
Public
Publication date
16-12-2024
Involved partners
Solana Networks (CAN)
Dalhousie University (CAN)
Karel Electronics (TUR)