On April 17, 2022 a data leak allegedly belonging to a video-focused social networking service TikTok started spreading on underground hacking platforms. It consisted of JSON and SQL files totalling 26GB.
Kaduu Team has analysed files in this “leak”. The dataset is just metadata for 32,489,068 TikTok videos, scraped between 2020-07-22 and 2020-10-13, meaning, it does not bear sensitive information.
The both JSON and SQL files represent the same data. As hacker describes: “Everything in the JSON file is unaltered response from TikTok, the MySQL database is a bit more trimmed down.”
In addition to the videos, there is metadata on:
- 12,382,540 sounds
- 2,533,869 challenges (hashtags)
- 218,479 authors (video creators)