A given combination of 7 bases has a probability of occurring of 1/16,384. Since the COVID genome is about 22k bases long I guess you have pretty good chance of it appearing in there somewhere. This assumes uniformity, which of course is not true. COVID’s genome is under crazy intense selection pressure!
Yep, the usual coding tools aren't ideal for bioinformatics. We have our own set of tools that work well with the various "standard" formats for sequence data.