occurrence and print

I am writing a program to go through a file and do some filtering (I attached the description in a pdf doc)

In summary:

  1. Command input:

python3 nucleotide_statistics_from_fasta.py –infile influenza.fasta –outfile influenz.stats.txt

2. Read the file

Save the header (accession) to header_list and save the sequence to sequence_list

3. Count the occurrence of each letter in each sequence (loop through all the sequence in the file)

4. Calculate GC-Content (for each sequence in the sequenc_list

GC-content = (numbers of G + number of C) / len(sequence)