readeat.com
robots.txt

Robots Exclusion Standard data for readeat.com

Resource Scan

Scan Details

Site Domain readeat.com
Base Domain readeat.com
Scan Status Ok
Last Scan2024-05-04T23:12:48+00:00
Next Scan 2024-06-03T23:12:48+00:00

Last Scan

Scanned2024-05-04T23:12:48+00:00
URL https://readeat.com/robots.txt
Domain IPs 104.21.83.146, 172.67.177.114, 2606:4700:3034::6815:5392, 2606:4700:3035::ac43:b172
Response IP 104.21.83.146
Found Yes
Hash d0ee94220f82738b029ee18894c768aa8e12f545ee12744239687c7bd24ac0f6
SimHash c07442724fb0

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow *?s=
Disallow *?status=
Disallow *%26s%3D
Disallow /search
Disallow *?keyword=
Disallow *?letter=
Disallow */feed
Disallow */rss
Disallow */embed
Disallow /xmlrpc.php
Disallow /cart
Disallow /checkout
Disallow /order
Disallow /account/
Allow */uploads
Allow /*.js*
Allow /*.css*
Allow /*.png*
Allow /*.jpg*
Allow /*.jpeg*
Allow /*.gif*
Allow /*.svg*
Allow /*.webp*

sebot-wa

Rule Path
Disallow /

adbot/1.0

Rule Path
Disallow /

dataforseobot/1.0

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

salesdoubler_feed_bot

Rule Path
Disallow /

wellknownbot/0.1

Rule Path
Disallow /

sebot-wa

Rule Path
Disallow /

Other Records

Field Value
sitemap https://readeat.com/sitemap.xml