newsclapper.com
robots.txt

Robots Exclusion Standard data for newsclapper.com

Resource Scan

Scan Details

Site Domain newsclapper.com
Base Domain newsclapper.com
Scan Status Ok
Last Scan2024-09-25T02:45:34+00:00
Next Scan 2024-10-25T02:45:34+00:00

Last Scan

Scanned2024-09-25T02:45:34+00:00
URL https://newsclapper.com/robots.txt
Redirect https://www.clapperapp.com/robots.txt
Redirect Domain www.clapperapp.com
Redirect Base clapperapp.com
Domain IPs 108.158.32.107, 108.158.32.116, 108.158.32.31, 108.158.32.79
Redirect IPs 23.215.7.25, 23.215.7.4
Response IP 23.52.40.16
Found Yes
Hash 5ba819ed00d439d21feda22b333e5b9e4892600ee7096119975ef5e2ed05f89f
SimHash 4c16536025e1

Groups

googlebot
applebot
bingbot
duckduckbot
yeti
twitterbot
yandex

Rule Path
Allow /

baiduspider
yisouspider
sogouspider

Rule Path
Disallow /

*

Rule Path
Allow /about
Allow /terms
Allow /privacy
Allow /community
Allow /copyright
Allow /contact

Warnings

  • 1 invalid line.