Top "Bioinformatics" questions

For programming-related questions related to Bioinformatics.

Remove part of string after "."

I am working with NCBI Reference Sequence accession numbers like variable a: a <- c("NM_020506.1","NM_020519.1","NM_001030297.2","NM_010281.2","…

r regex string bioinformatics biomart
WinError 2 The system cannot find the file specified (Python)

I have a Fortran program and want to execute it in python for multiple files. I have 2000 input files but …

python python-2.7 python-3.x bioinformatics f2py
How much storage would be required to store a human genome?

I'm looking for the amount of storage in bytes (MB, GB, TB, etc.) required to store a single human genome. …

storage bioinformatics dna-sequence genetics
Reverse complement of DNA strand using Python

I have a DNA sequence and would like to get reverse complement of it using Python. It is in one …

python list bioinformatics biopython dna-sequence
How to subtract strings in python

Basically, if I have a string 'AJ' and another string 'AJYF', I would like to be able to write 'AJYF'-'AJ' …

python python-3.x string bioinformatics
Count occurrences of given character per cell

Question For example if I wanted to count the number of Ns in a column of strings how can I …

google-sheets count character bioinformatics
Finding matching keys in two large dictionaries and doing it fast

I am trying to find corresponding keys in two different dictionaries. Each has about 600k entries. Say for example: myRDP = { …

python bioinformatics
How to remove rows with 0 values using R

Hi am using a matrix of gene expression, frag counts to calculate differentially expressed genes. I would like to know …

r bioinformatics
R extract part of string

I have a question about extracting a part of a string. For example I have a string like this: a &…

r regex string bioinformatics vcf-variant-call-format
How can I convert Ensembl ID to gene symbol in R?

I have a data.frame containing Ensembl IDs in one column; I would like to find corresponding gene symbols for …

r dataframe bioinformatics bioconductor