bigscal.com
robots.txt

Robots Exclusion Standard data for bigscal.com

Resource Scan

Scan Details

Site Domain bigscal.com
Base Domain bigscal.com
Scan Status Ok
Last Scan2024-09-30T07:28:40+00:00
Next Scan 2024-10-07T07:28:40+00:00

Last Scan

Scanned2024-09-30T07:28:40+00:00
URL https://bigscal.com/robots.txt
Domain IPs 104.26.0.27, 104.26.1.27, 172.67.73.1, 2606:4700:20::681a:11b, 2606:4700:20::681a:1b, 2606:4700:20::ac43:4901
Response IP 104.26.0.27
Found Yes
Hash 56676a414f283764dfaa034724c831ae47f3514052608df8ce5b3f48b52e7677
SimHash 781cffc07ac0

Groups

*

Rule Path
Allow /
Allow /wp-admin/admin-ajax.php
Disallow /wp-admin/
Disallow /page-image-checking
Disallow /author/*
Disallow /thank-you.html
Disallow /404.html
Disallow /blogs/tag/
Disallow *?taxonomy=*
Disallow *?wordfence_*
Disallow /.htaccess
Disallow /search/

Other Records

Field Value
crawl-delay 10

linkedinbot
bingbot
rsiteauditor
siteauditbot
semrushbot
semrushbot-sa
semrushbot-si
semrushbot-ba
semrushbot-swa
semrushbot-ct
splitsignalbot
semrushbot-coub
ahrefsbot
ahrefssiteaudit

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 60

rytebot
onpagebot
dotbot
rogerbot
mj12bot
seznambot
dataforseobot
petalbot
coccocbot-web
seokicks
awariorssbot
awariosmartbot
serendeputybot
mojeekbot
scrapy
orbbot
mauibot
neevabot
x09mozilla
x22mozilla

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.bigscal.com/sitemap_index.xml

Comments

  • Known crawlers
  • Blocked crawlers