
Encrypted Traffic Classification Datasets
- Project
- 20020 ENTA
- Type
- New service
- Description
Eleven application detection datasets have been created for use in training and testing AI-based encrypted network traffic classification models.
- Contact
- Nur Zincir-Heywood
- nzincirh@dal.ca
- Research area(s)
- Encrypted Traffic Classification, Activity Detection, Traffic Analysis
- Technical features
File Naming Convention: All files under the name {IMA-name}_encrypted_traffic.pcap contain the encrypted traffic resulting from the corresponding IMA. Similarly, all files under the name {IMA-name}_encrypted_traffic_flows.txt contain the text-based description of flows we used to build our models. No timestamps are added as they are already contained in the .pcap files.
They can be used to train/test/evaluate models: 1 Mobile Instant Messaging text chat traffic (publicly available at IEEE DataPort https://ieee-dataport.org/documents/encrypted-mobile-instant-messaging-traffic-dataset; 1 text-based Instant messaging group chat; 1 VoIP-based Instant messaging group chat; 2 datasets with streaming audio and video traffic and OTT applications; 3 datasets with Instant Messaging applications – wireline and mobile; 1 datasets with Google Applications; 1 dataset on social media captured using bots; 1 dataset with mix of streaming video, social media, interactive live stream.
- Integration constraints
The data can be used for model training for Machine Learning or Deep Learning experiments.
- Targeted customer(s)
Researchers in the area of Network Traffic Analysis. Researchers can be from Academia or Industry.
- Conditions for reuse
The terms of reuse is dictated by IEEE Dataport licensing agreement.
- Confidentiality
- Public
- Publication date
- 16-12-2024
- Involved partners
- Solana Networks (CAN)
- Dalhousie University (CAN)
- Karel Electronics (TUR)