thriveread.com
robots.txt

Robots Exclusion Standard data for thriveread.com

Resource Scan

Scan Details

Site Domain thriveread.com
Base Domain thriveread.com
Scan Status Ok
Last Scan2024-11-14T02:18:52+00:00
Next Scan 2024-11-21T02:18:52+00:00

Last Scan

Scanned2024-11-14T02:18:52+00:00
URL https://thriveread.com/robots.txt
Domain IPs 104.21.57.222, 172.67.150.89, 2606:4700:3034::6815:39de, 2606:4700:3037::ac43:9659
Response IP 172.67.150.89
Found Yes
Hash 07be5bd85b25761e7ea6deebc2de3ee4507e6b3aa1f50f8fcb738becab38cf51
SimHash c906495189d2

Groups

*

Rule Path
Allow /
Allow /topic/*.js/
Disallow /sign-up/
Disallow /sign-up-routing/
Disallow /feedback/tg20/
Disallow /feedback/thanks/
Disallow /feedback/flyer/
Disallow /error/
Disallow /ezoic/
Disallow *.js
Disallow /ezais/
Disallow /humix/
Disallow /21732118914%2C23036119598/
Disallow /video/
Disallow /search/
Disallow /Pages/
Disallow /authors/rose-waitherero/
Disallow /history
Disallow /IABid/
Disallow /21732118914/

Other Records

Field Value
sitemap https://thriveread.com/sitemap.xml

Warnings

  • 4 invalid lines.