Hacker News new | past | comments | ask | show | jobs | submit login

Probably that certain base sequences are more likely to occur together than random. If you account for this, it'll help in your compression.



That's right - the Huffman tree was built based on the base frequency of use, which actually varies depending on what type of organism you're sequencing[1].

[1] https://en.wikipedia.org/wiki/GC-content#Among-genome_variat...




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: