occurrence and print
I am writing a program to go through a file and do some filtering (I attached the description in a pdf doc)
In summary:
- Command input:
python3 nucleotide_statistics_from_fasta.py –infile influenza.fasta –outfile influenz.stats.txt
2. Read the file
Save the header (accession) to header_list and save the sequence to sequence_list
3. Count the occurrence of each letter in each sequence (loop through all the sequence in the file)
4. Calculate GC-Content (for each sequence in the sequenc_list
GC-content = (numbers of G + number of C) / len(sequence)