thriveread.com
robots.txt
Robots Exclusion Standard data for thriveread.com
Resource Scan
Scan Details
Site Domain | thriveread.com |
Base Domain | thriveread.com |
Scan Status | Ok |
Last Scan | 2024-11-14T02:18:52+00:00 |
Next Scan | 2024-11-21T02:18:52+00:00 |
Last Scan
Scanned | 2024-11-14T02:18:52+00:00 |
URL | https://thriveread.com/robots.txt |
Domain IPs | 104.21.57.222, 172.67.150.89, 2606:4700:3034::6815:39de, 2606:4700:3037::ac43:9659 |
Response IP | 172.67.150.89 |
Found | Yes |
Hash | 07be5bd85b25761e7ea6deebc2de3ee4507e6b3aa1f50f8fcb738becab38cf51 |
SimHash | c906495189d2 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Allow | /topic/*.js/ |
Disallow | /sign-up/ |
Disallow | /sign-up-routing/ |
Disallow | /feedback/tg20/ |
Disallow | /feedback/thanks/ |
Disallow | /feedback/flyer/ |
Disallow | /error/ |
Disallow | /ezoic/ |
Disallow | *.js |
Disallow | /ezais/ |
Disallow | /humix/ |
Disallow | /21732118914%2C23036119598/ |
Disallow | /video/ |
Disallow | /search/ |
Disallow | /Pages/ |
Disallow | /authors/rose-waitherero/ |
Disallow | /history |
Disallow | /IABid/ |
Disallow | /21732118914/ |
Other Records
Field | Value |
---|---|
sitemap | https://thriveread.com/sitemap.xml |
Warnings
- 4 invalid lines.