Forum:Problem with homer in findMotifs.pl when using input and bg fasta

Ilario

Junior Member

Join Date: Mar 2023
Posts: 1

Forum:Problem with homer in findMotifs.pl when using input and bg fasta

03-14-2023, 05:43 AM

I am using the following command:

Code:

findMotifs.pl input.fa fasta ./Output -fastaBg bg.fa -len 8,10,12 -norevopp

The input and bg fasta have this structure and the bg fasta include all sequences in input plus many others:

Code:

>ENSMUST00000027125_Coq10b_mmu_chr1_55071635_55072702_+_utr_55071803_55072702(+)
ATTTCTTTTGAATTCCGCTCCCTTCTGCACTCTCAGCTCGCTACTCTGTTCTTCGATGAAGTTGTGAAACAAATGGTAGC AGCCTTTGAAAGAAGAGCCTGTAAACTGTATGGTCCAGAGACAAACATACCTCGGGAATTAATGCTTCATGAAATTCACC ACACCTAAGAGGAAAATATTAGCTGCCTCCACCTACTCTTGGCTAGTTTGTTCACTTCTAGGAAGTCCTTTTACCATCTG` TTGAGAAGTCAGAAAGCATTTGTTAAACCTGCCTTGATTCTAAGCCCGTGCTGTTGAAAATTTGCACATTGAACATGGAC CCACTTGTACATAGAATTATTTCTTCAATCAAGTGTGACTCTAAGTATCATGTACATTTGCAGGCTCCGACCACCTTTGT AATAACGGATGTCATCACTGTTGCTAGGATACCACATTCCTCGTTTGAGTGTACAGATGAACAAGTCTTTTAATTCTCAC CTTACATGAAAAGGTTAGCTGAGATACAATGTGTGTTATATTAACCATATCATGTTTAAGTTATTAGGTTCAGAGTATTT GTAACTTATTGTTATTCGGCATGCCATATGGCTTAGGGTATTTGAATAATCATATATTTACCATTAAAACTGTGATTTAA AGTATTGCTAATGAAGTCTTAGCACTTTGGGTATTTTAATTGTTCTTATGGGTAGCAGTAGATGATTCAGTGTTGTTGGG

But I get the following error:

Code:

Selected Options:

Input file = /media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/APAdetection/cpeb4_target.fa

Promoter Set = fasta

Output Directory = ./CPEB4

Will use FASTA files for motif finding

Target Sequences = /media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/APAdetection/cpeb4_target.fa

Background Sequences = /media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/APAdetection/most_used_cpeb4.fa

Motif length set at 8, 10, 12,

Will not search the reverse strand

Using custom gene IDs for GO analysis

Parsing FASTA format files...




Progress: Step4 - removing redundant promoters




Progress: Step5 - adjusting background sequences for GC/CpG content...




Sequences processed:

0 total




Frequency Bins: 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.6 0.7 0.8

Freq Bin Count

Illegal division by zero at /software/HOMER/bin/assignGeneWeights.pl line 63.




Normalizing lower order oligos using homer2




Reading input files...

0 total sequences read

Autonormalization: 1-mers (4 total)

A inf% inf% -nan

C inf% inf% -nan

G inf% inf% -nan

T inf% inf% -nan

Autonormalization: 2-mers (16 total)

AA inf% inf% -nan

CA inf% inf% -nan

GA inf% inf% -nan

TA inf% inf% -nan

AC inf% inf% -nan

CC inf% inf% -nan

GC inf% inf% -nan

TC inf% inf% -nan

AG inf% inf% -nan

CG inf% inf% -nan

GG inf% inf% -nan

TG inf% inf% -nan

AT inf% inf% -nan

CT inf% inf% -nan

GT inf% inf% -nan

TT inf% inf% -nan

Autonormalization: 3-mers (64 total)

Normalization weights can be found in file: ./CPEB4/seq.autonorm.tsv

Converging on autonormalization solution:

...............................................................................

Final normalization: Autonormalization: 1-mers (4 total)

A inf% inf% -nan

C inf% inf% -nan

(base) [B]idetoma@sequentia[/B]:[B]/media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/motifs[/B]$ more nohup.out




Selected Options:

Input file = /media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/APAdetection/cpeb4_target.fa

Promoter Set = fasta

Output Directory = ./CPEB4

Will use FASTA files for motif finding

Target Sequences = /media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/APAdetection/cpeb4_target.fa

Background Sequences = /media/sequentia/synology_office2/Projects/554-Belloc/RIPseq/APAdetection/most_used_cpeb4.fa

Motif length set at 8, 10, 12,

Will not search the reverse strand

Using custom gene IDs for GO analysis

Parsing FASTA format files...




Progress: Step4 - removing redundant promoters




Progress: Step5 - adjusting background sequences for GC/CpG content...




Sequences processed:

0 total




Frequency Bins: 0.2 0.25 0.3 0.35 0.4 0.45 0.5 0.6 0.7 0.8

Freq Bin Count

Illegal division by zero at /software/HOMER/bin/assignGeneWeights.pl line 63.




Normalizing lower order oligos using homer2




Reading input files...

0 total sequences read

Autonormalization: 1-mers (4 total)

A inf% inf% -nan

C inf% inf% -nan

G inf% inf% -nan

T inf% inf% -nan

Autonormalization: 2-mers (16 total)

AA inf% inf% -nan

CA inf% inf% -nan

GA inf% inf% -nan

TA inf% inf% -nan

AC inf% inf% -nan

CC inf% inf% -nan

GC inf% inf% -nan

TC inf% inf% -nan

AG inf% inf% -nan

CG inf% inf% -nan

GG inf% inf% -nan

TG inf% inf% -nan

AT inf% inf% -nan

CT inf% inf% -nan

GT inf% inf% -nan

TT inf% inf% -nan

Autonormalization: 3-mers (64 total)

Normalization weights can be found in file: ./CPEB4/seq.autonorm.tsv

Converging on autonormalization solution:

...............................................................................

Final normalization: Autonormalization: 1-mers (4 total)

A inf% inf% -nan

C inf% inf% -nan

G inf% inf% -nan

T inf% inf% -nan

Autonormalization: 2-mers (16 total)

AA inf% inf% -nan

CA inf% inf% -nan

GA inf% inf% -nan

TA inf% inf% -nan

AC inf% inf% -nan

CC inf% inf% -nan

GC inf% inf% -nan

TC inf% inf% -nan

AG inf% inf% -nan

CG inf% inf% -nan

GG inf% inf% -nan

TG inf% inf% -nan

AT inf% inf% -nan

CT inf% inf% -nan

GT inf% inf% -nan

TT inf% inf% -nan

Autonormalization: 3-mers (64 total)




Progress: Step6 - Gene Ontology Enrichment Analysis

Skipping...




Progress: Step7 - Known motif enrichment




Reading input files...

0 total sequences read

1006 motifs loaded

Cache length = 11180

Using hypergeometric scoring

Checking enrichment of 1006 motif(s)

|0% 50% 100%|

=================================================================================

Illegal division by zero at /software/HOMER/bin/findKnownMotifs.pl line 152.




Progress: Step8 - De novo motif finding (HOMER)




Scanning input files...

!!! Something is wrong... are you sure you chose the right length for motif finding?

!!! i.e. also check your sequence file!!!




Scanning input files...

!!! Something is wrong... are you sure you chose the right length for motif finding?

!!! i.e. also check your sequence file!!!




-blen automatically set to 2

Scanning input files...

!!! Something is wrong... are you sure you chose the right length for motif finding?

!!! i.e. also check your sequence file!!!

Use of uninitialized value in numeric gt (>) at /software/HOMER/bin/compareMotifs.pl line 1394.

!!! Filtered out all motifs!!!

Job finished

What could be the problem?

Tags: homer, motif finding

Previous template Next

Recent Developments in Metagenomics

by seqadmin

Metagenomics has improved the way researchers study microorganisms across diverse environments. Historically, studying microorganisms relied on culturing them in the lab, a method that limits the investigation of many species since most are unculturable¹. Metagenomics overcomes these issues by allowing the study of microorganisms regardless of their ability to be cultured or the environments they inhabit. Over time, the field has evolved, especially with the advent...
- Channel: Articles
09-23-2024, 06:35 AM
Understanding Genetic Influence on Infectious Disease

by seqadmin

During the COVID-19 pandemic, scientists observed that while some individuals experienced severe illness when infected with SARS-CoV-2, others were barely affected. These disparities left researchers and clinicians wondering what causes the wide variations in response to viral infections and what role genetics plays.

Jean-Laurent Casanova, M.D., Ph.D., Professor at Rockefeller University, is a leading expert in this crossover between genetics and infectious...
- Channel: Articles
09-09-2024, 10:59 AM

Topics	Statistics	Last Post
Mechanical Forces in DNA Transcription Uncovered by Clemson Researchers by seqadmin Started by seqadmin, 10-02-2024, 04:51 AM	0 responses 13 views 0 likes	Last Post by seqadmin 10-02-2024, 04:51 AM
New Epigenetic Clock Links Cheek Cells to Mortality Risk by seqadmin Started by seqadmin, 10-01-2024, 07:10 AM	0 responses 21 views 0 likes	Last Post by seqadmin 10-01-2024, 07:10 AM
AI-Powered Blood Test Shows Promise for Early Ovarian Cancer Detection by seqadmin Started by seqadmin, 09-30-2024, 08:33 AM	0 responses 25 views 0 likes	Last Post by seqadmin 09-30-2024, 08:33 AM
Stem Cell Research Suggests Human Cells May Enter Developmental Pause by seqadmin Started by seqadmin, 09-26-2024, 12:57 PM	0 responses 18 views 0 likes	Last Post by seqadmin 09-26-2024, 12:57 PM

Seqanswers Leaderboard Ad

Announcement

Forum:Problem with homer in findMotifs.pl when using input and bg fasta

Latest Articles

ad_right_rmr

News