I’m trying to create an HTTP fuzzing API in python in which I wish to store HTTP requests and responses for later analysis, now as I’m fuzzing an endpoint most of the requests and responses only differ by a couple of words, some html tags, or json content.
for example fuzzing an URL with non-existing paths would generate the same 404 response message ( or might indicate that path X doesn’t exist on the server ) but still most of the response body is the same across all the responses.
this would make the API memory inefficient if used with big wordlists.
now what I’m asking is that, is there a python package, or a specific data structure that I can use for a generic content type (even if it doesn’t work with binary data ) that can be used to make the API more memory efficient ?
and if not, how hard is it to make such a data structure if it is even plausible ?