sciencescafe.com
robots.txt

Robots Exclusion Standard data for sciencescafe.com

Resource Scan

Scan Details

Site Domain sciencescafe.com
Base Domain sciencescafe.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-03T03:19:12+00:00
Next Scan 2024-10-10T03:19:12+00:00

Last Successful Scan

Scanned2024-09-25T02:43:16+00:00
URL https://sciencescafe.com/robots.txt
Domain IPs 65.20.71.86
Response IP 65.20.71.86
Found Yes
Hash d716099f2d6e8dafaabeb77296eb7e22ff3a15cc8dbc35779e97ed9cf9929c59
SimHash 60014baef293

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /linkout/
Disallow /recommended/
Disallow /comments/feed/
Disallow /trackback/
Disallow /index.php
Disallow /xmlrpc.php

ninjabot

Rule Path
Allow /

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

Other Records

Field Value
sitemap https://sciencescafe.com/sitemap_index.xml