sistacafe.com
robots.txt
Robots Exclusion Standard data for sistacafe.com
Resource Scan
Scan Details
Site Domain | sistacafe.com |
Base Domain | sistacafe.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 4/7/2025, 11:40:59 PM |
Next Scan | 6/6/2025, 11:40:59 PM |
Last Successful Scan
Scanned | 2/7/2025, 11:36:53 PM |
URL | https://sistacafe.com/robots.txt |
Domain IPs | 104.21.30.178, 172.67.173.126, 2606:4700:3035::6815:1eb2, 2606:4700:3037::ac43:ad7e |
Response IP | 104.21.30.178 |
Found | Yes |
Hash | 780731da1707d5f2fefc10bc3ee4b395e59f27fd2a5bde9f2d40bcc5198e3656 |
SimHash | 5b108f34ef52 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Allow | /sis-collabs |
Allow | /original-content |
Allow | /register |
Allow | /login |
Allow | /ranking/* |
Allow | /user/* |
Allow | /summaries/* |
Allow | /reviews/* |
Allow | /products/* |
Disallow | /c/* |
Disallow | /admin/* |
Disallow | /galleries/* |
Disallow | /scripts/truehits/* |
Other Records
Field | Value |
---|---|
sitemap | https://sistacafe.com/sitemap.xml |