cf.groundtruth.com
robots.txt

Robots Exclusion Standard data for cf.groundtruth.com

Resource Scan

Scan Details

Site Domain cf.groundtruth.com
Base Domain groundtruth.com
Scan Status Ok
Last Scan2024-10-21T21:53:00+00:00
Next Scan 2024-11-20T21:53:00+00:00

Last Scan

Scanned2024-10-21T21:53:00+00:00
URL https://cf.groundtruth.com/robots.txt
Domain IPs 18.155.68.2, 18.155.68.34, 18.155.68.80, 18.155.68.95
Response IP 18.155.68.95
Found Yes
Hash 72d9ac9f5cb74324da86e38164f9352b723dd03ffc593df679d7ae7dd440343d
SimHash 8804d800cbf3

Groups

twitterbot

Rule Path
Disallow

*

Rule Path
Disallow /