Top "Bioinformatics" questions

For programming-related questions related to Bioinformatics.

Read FASTA into a dataframe and extract subsequences of FASTA file

I have a small fasta file of DNA sequences which looks like this: >NM_000016 700 200 234 ACATATTGGAGGCCGAAACAATGAGGCGTGATCAACTCAGTATATCAC >NM_000775 700 124 236 CTAACCTCTCCCAGTGTGGAACCTCTATCTCATGAGAAAGCTGGGATGAG >…

r subset bioinformatics fasta
How to call module written with argparse in iPython notebook

I am trying to pass BioPython sequences to Ilya Stepanov's implementation of Ukkonen's suffix tree algorithm in iPython's notebook environment. …

python ipython bioinformatics biopython suffix-tree
Dictionary style replace multiple items

I have a large data.frame of character data that I want to convert based on what is commonly called …

r dataframe bioinformatics
Organizing the output of my shell script into tables within the text file

I am working with a unix shell script that does genome construction then creates a phylogeny. Depending on the genome …

shell unix text-files bioinformatics genome
biopython no module named Bio

FYI: this is NOT a duplicate! Before running my python code I installed biopython in the cmd prompt: pip install …

python python-3.x pip bioinformatics biopython
Remove rows from dataframe that contains only 0 or just a single 0

I am trying to create a function in R that will allow me to filter my data set based on …

r filtering bioinformatics data-processing
Finding overlap in ranges with R

I have two data.frames each with three columns: chrom, start & stop, let's call them rangesA and rangesB. For …

r bioinformatics
Remove line breaks in a FASTA file

I have a fasta file where the sequences are broken up with newlines. I'd like to remove the newlines. Here's …

unix awk newline bioinformatics fasta
Undefined subroutines &main error in Perl

I am trying to extract a DNA sequence from this FASTA file to a specified length of bases per line, …

perl bioinformatics subroutine fasta
Pandas: .groupby().size() and percentages

I have a DataFrame that originates from a df.groupby().size() operation, and looks like this: Localization RNA level cytoplasm 1 …

python pandas bioinformatics