Tlextractor
TLExtractor is a Python script that extracts data from tldraw pages, saves it as JSON, and process images. Supporting both standard and Custom Submission Template with features like depth-first search, async, multi-threading and multi-processing.
This is mainly used to extract specific students data. Where they would utilize a template to submit their project to. In which this script would then utilize to extract the necessary data.
Video
-
Depth-First Searchalgorithm used to search through tree-like data structure, starting from the deepest and working up. -
Async programmingcreating as many coroutine objects as there are pages. Running all of them concurrently while ensuring minimal overhead -
Multi-Processingused to process multiple images in true parallel -
Playwrightwebsite scrapper. -
Multi-Threadingloading threads for each page. Simulate loading screen -
Pythonas the main programming language.
Example
#=> JSON Data Structure
{
"project title": "CORE STUDIO 02-24-TEST",
"data": [
{
"page": "benchmark 01",
"date": "DUE 26 MAY (SUNDAY) 2359",
"description": "First iteration of site in blender/rhino",
"students": [
"person1",
"person2",
"person3",
]
}
]
}Check it out on Github/Tlextractor for more information.