If there is a huge file with millions of lines and all lines have numbers with tab separation and no line has more than 15 numbers, let say all number are between 10000 to 35000, what tools/programming language should I use to find out the the most significant range in the numbers e.g. 80% of the numbers are between 18000 to 27000 or 40% of the numbers are between 18000 to 27000 ? Can you suggest some thing ?
I have tried python with numpy but it does work, I can find out how many numbers are odd and how many are even but I can’t find the significant range in the file. And finding the numbers if the are Odd and Even takes a lot of time too.