Relative Content

Tag Archive for compression

Arithmetic Coding Questions

I have been reading up on arithmetic coding and, while I understand how it works, all the guides and instructions I’ve read start with something like:

Fast compression in C++ and decompression in C#

Overview I am working on a client-server application. The client is written in C++ (working on Windows, planning to support Linux) and the server is a .NET RESTful service. I need to HTTP POST some data to the server. The data is a unicode string that I need to compress using any fast compression algorithm […]

Decompress a .csv.gz file

I’m trying to decompress multiple .cgv.gz files and I can’t seem to find somewhere where it’s explained clearly.

How should I handle different hashes of identical files in .zip archive with different ‘last changed’ date?

We store zipped files in the storage of a cloud provider which contain certain fields (metadata). These files are derived from other, larger files. Every time we (re)generate these files, their ‘last changed’ date is set to the generation time, while the content of the file is identical. When we recreate one of these files, which have previously been stored in the online storage, their file hashes (md5/sha) differ. The reason for that is that the zip algorithm seems to include the ‘last changed’ information in the .zip file.