openrice.com
robots.txt

Robots Exclusion Standard data for openrice.com

Resource Scan

Scan Details

Site Domain openrice.com
Base Domain openrice.com
Scan Status Ok
Last Scan2024-11-09T21:21:24+00:00
Next Scan 2024-11-16T21:21:24+00:00

Last Scan

Scanned2024-11-09T21:21:24+00:00
URL https://openrice.com/robots.txt
Redirect https://www.openrice.com/robots.txt
Redirect Domain www.openrice.com
Redirect Base openrice.com
Domain IPs 170.33.8.214
Redirect IPs 170.33.8.214
Response IP 170.33.8.214
Found Yes
Hash a1f517dde153513eb449b18f9c925d728f1d4e1c2cc4e03204a6524ced734fc1
SimHash a0d15543a792

Groups

*

Rule Path
Disallow /service/
Disallow /service2/
Disallow /webservice/
Disallow /*/restaurant/report.htm
Disallow /*/restaurant/report/
Disallow /*/reports/
Disallow /*/restaurant/mapreport.htm
Disallow /*/restaurant/write.htm
Disallow /*/review/write
Disallow /*/restaurant/similar.htm
Disallow /myopenrice/addbookrestaurant.htm
Disallow /*/restaurant/EmailFriendmode.htm
Disallow /*/restaurant/flagreview.htm
Disallow /*/restaurant/comment.htm
Disallow /*/restaurant/apicomments.htm
Disallow /*/restaurant/dbsoffer.htm
Disallow /*/restaurant/recipe.htm
Disallow /*/restaurant/userinfo.htm
Disallow /*/gourmet/reviews.htm
Disallow /*/gourmet/photos.htm
Disallow /*/gourmet/videos.htm
Disallow /*/gourmet/bookmarkrestaurant.htm
Disallow /*/gourmet/bookmarkuser.htm
Disallow /*/gourmet/bookmarkcoupon.htm
Disallow /*/gourmet/bookmarkreview.htm
Disallow /stat/
Disallow /stats/
Disallow /big5/
Disallow /info/ptvapp/