For text, I found a couple messages on the wiki which describes how text is sent to the player when a scene is run:

"Just to add, if you want the text, it is stored contiguously in a response packet. After you receive the image of the H-scene, you'll send a packet, in its response tab will be some garbage (based of of what is present, it's probably the setting + names of what's necessary to load the text + overlay + click events). After that, you'll send another packet. In its response will be the text for all the characters/most of."