therealreal.com
robots.txt

Robots Exclusion Standard data for therealreal.com

Resource Scan

Scan Details

Site Domain therealreal.com
Base Domain therealreal.com
Scan Status Ok
Last Scan2024-10-31T18:47:17+00:00
Next Scan 2024-11-07T18:47:17+00:00

Last Scan

Scanned2024-10-31T18:47:17+00:00
URL https://therealreal.com/robots.txt
Redirect https://www.therealreal.com/robots.txt
Redirect Domain www.therealreal.com
Redirect Base therealreal.com
Domain IPs 151.101.64.242
Redirect IPs 151.101.0.242, 151.101.128.242, 151.101.192.242, 151.101.64.242
Response IP 199.232.44.242
Found Yes
Hash 62515428c384d2034bd3fb10d09288592b3ceb53121377d0dfabf13378363bf6
SimHash a8e50b0cfcf1

Groups

*

Rule Path
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /countries
Disallow /login
Disallow /line_items
Disallow /password_resets
Disallow /states
Disallow /user_sessions
Disallow /users
Disallow /admin
Disallow /phoenix*
Disallow /consign/*
Disallow /consignments/*
Disallow /ev56mY37/xhr/*

Other Records

Field Value
sitemap https://assets.therealreal.com/sitemaps/sitemap_index.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file.