Where I work we've been debating about this a lot. I work with log data from CDN...

unilynx · on April 30, 2018

Wouldn't storing the first three octets of an IP address be enough for this kind of analysis? Or use the whois database and reduce the data to the first IP address of the network ?

ryanworl · on April 30, 2018

I personally think just storing the autonomous system the IP originates from and never writing the IPs to disk at all would be advisable if the goal is purely which ISPs are delivering how many bytes to end users. Another benefit is the AS to IP mapping database is small enough to fit in memory without issue.

oasisbob · on May 3, 2018

That's probably insufficient for the usecase. A single AS can advertise many different routes for different IP blocks that have dramatic geographic differences.

merinowool · on April 30, 2018

How are you going to ask user for consent to process their IP this way?

ryanworl · on April 30, 2018

Consent is not the only basis for legally processing data. There is not enough information in the above comment to determine which basis this company has determined their processing falls under.