cancertotob.com
robots.txt

Robots Exclusion Standard data for cancertotob.com

Resource Scan

Scan Details

Site Domain cancertotob.com
Base Domain cancertotob.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't establish SSL connection.
Last Scan2024-08-31T00:08:36+00:00
Next Scan 2024-09-30T00:08:36+00:00

Last Successful Scan

Scanned2024-07-10T00:07:19+00:00
URL https://cancertotob.com/robots.txt
Redirect https://www.childrenofalessergodbroadway.com/robots.txt
Redirect Domain www.childrenofalessergodbroadway.com
Redirect Base childrenofalessergodbroadway.com
Domain IPs 104.21.85.133, 172.67.206.48, 2606:4700:3030::6815:5585, 2606:4700:3036::ac43:ce30
Redirect IPs 104.21.40.143, 172.67.153.8, 2606:4700:3034::ac43:9908, 2606:4700:3035::6815:288f
Response IP 172.67.153.8
Found Yes
Hash 31eadaad343ffa1b2836781cd069ba8cfd6e27bcb4cd62efdabfa992dbf2ee89
SimHash 0913d335c7d1

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Allow /*.jpg$
Allow /*.jpeg$
Allow /*.gif$
Allow /*.png$
Allow /*.webp$

dmca.com page protection crawling service

Rule Path
Allow /

architextspider

Rule Path
Allow /

archive.org_bot

Rule Path
Allow /

applebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

exabot

Rule Path
Allow /

facebot

Rule Path
Allow /

feedfetcher-google

Rule Path
Allow /

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

screaming frog seo spider

Rule Path
Allow /

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

yahoo pipes 1.0

Rule Path
Allow /

yahoo! slurp

Rule Path
Allow /

yandexbot

Rule Path
Allow /

yandeximages

Rule Path
Allow /

yandexnews

Rule Path
Allow /

yandexwebmaster

Rule Path
Allow /

yandexpagechecker

Rule Path
Allow /

yeti

Rule Path
Allow /

zyborg

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.childrenofalessergodbroadway.com/sitemap.xml

Warnings

  • 2 invalid lines.