100searchengines.com
robots.txt

Robots Exclusion Standard data for 100searchengines.com

Resource Scan

Scan Details

Site Domain 100searchengines.com
Base Domain 100searchengines.com
Scan Status Ok
Last Scan2024-06-08T23:34:17+00:00
Next Scan 2024-06-15T23:34:17+00:00

Last Scan

Scanned2024-06-08T23:34:17+00:00
URL https://100searchengines.com/robots.txt
Redirect https://www.100searchengines.com/robots.txt
Redirect Domain www.100searchengines.com
Redirect Base 100searchengines.com
Domain IPs 50.28.77.191
Redirect IPs 50.28.77.191
Response IP 50.28.77.191
Found Yes
Hash 340b0c7d60b7b034f877e595ec1a765bf92f79d2381e751adb31832c6716aa86
SimHash 413859fdcc94

Groups

*

Rule Path
Disallow *TEXIS_ERROR*
Disallow *newsproset*
Disallow *trendscoset*
Disallow *autocompleteset*
Disallow *nightmode*
Disallow *safesearch*
Disallow *targetset*
Disallow *popcoset*
Disallow *gameselect*
Disallow *englishonlyset*
Disallow *mobilepopular*
Disallow *area%3Dgames*
Disallow *q%3D*
Disallow *dir5%3D*
Disallow *advanced*
Disallow /texis/
Disallow */texis/*
Disallow /mail/
Disallow /reverse-phone/
Disallow /free-internet-games/
Disallow /contact-us/
Disallow /cgi-bin/
Disallow /xml/
Disallow /?q=
Disallow q%3D
Disallow /q%3D

adidxbot

Rule Path
Disallow

adsbot

Rule Path
Disallow

ahrefsbot

Rule Path
Disallow

amazonbot

Rule Path
Disallow

anthropic-ai

Rule Path
Disallow

applebot

Rule Path
Disallow

arquivo-web-crawler

Rule Path
Disallow

baiduspider

Rule Path
Disallow

barkrowler

Rule Path
Disallow

blexbot

Rule Path
Disallow

bytespider

Rule Path
Disallow

ccbot

Rule Path
Disallow

chatgpt-user

Rule Path
Disallow

claudebot

Rule Path
Disallow

claude-web

Rule Path
Disallow

cohere-ai

Rule Path
Disallow

dataforseobot

Rule Path
Disallow

diffbot

Rule Path
Disallow

dotbot

Rule Path
Disallow

facebookbot

Rule Path
Disallow

freefind

Rule Path
Disallow

google-extended

Rule Path
Disallow

gptbot

Rule Path
Disallow

grapeshot

Rule Path
Disallow

ia_archiver

Rule Path
Disallow

imagesiftbot

Rule Path
Disallow

ltx71 - (http://ltx71.com/)

Rule Path
Disallow

magpie-crawler

Rule Path
Disallow

mauibot

Rule Path
Disallow

mbcrawler

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

mj12bot

Rule Path
Disallow

netseer

Rule Path
Disallow

obot

Rule Path
Disallow

omgili

Rule Path
Disallow

omgilibot

Rule Path
Disallow

openxadstxtcrawler

Rule Path
Disallow

perplexitybot

Rule Path
Disallow

pinterestbot

Rule Path
Disallow

proximic

Rule Path
Disallow

seekport crawler

Rule Path
Disallow

seekportbot

Rule Path
Disallow

semrushbot

Rule Path
Disallow

serpstatbot

Rule Path
Disallow

surdotlybot

Rule Path
Disallow

teoma

Rule Path
Disallow

tpradstxtcrawler

Rule Path
Disallow

twitterbot

Rule Path
Disallow

youbot

Rule Path
Disallow

zoominfobot

Rule Path
Allow /ads.txt

Other Records

Field Value
sitemap https://www.100searchengines.com/sitemap.xml

Warnings

  • `noindex` is not a known field.