papier.com
robots.txt

Robots Exclusion Standard data for papier.com

Resource Scan

Scan Details

Site Domain papier.com
Base Domain papier.com
Scan Status Ok
Last Scan2024-06-21T11:10:15+00:00
Next Scan 2024-07-21T11:10:15+00:00

Last Scan

Scanned2024-06-21T11:10:15+00:00
URL https://papier.com/robots.txt
Redirect https://www.papier.com/robots.txt
Redirect Domain www.papier.com
Redirect Base papier.com
Domain IPs 13.227.254.19, 13.227.254.32, 13.227.254.81, 13.227.254.96
Redirect IPs 13.248.194.187, 76.223.81.112
Response IP 76.223.81.112
Found Yes
Hash 22a215c3afa0cc72674e1c713c38209c94d3d1ee5647616cc9bd2a4e35b64c6e
SimHash 1a840f856574

Groups

*

Rule Path
Disallow /search?
Disallow /*/search
Disallow /customise
Disallow /*/customise
Disallow /saved_designs
Disallow /*/saved_designs
Disallow /fr/thefold
Disallow /de/thefold
Disallow /thefold/more-articles
Disallow /*/thefold/more-articles
Disallow /*thanks$
Disallow /*basket$
Disallow /*checkout$
Disallow /manage

Other Records

Field Value
sitemap https://www.papier.com/sitemap.xml

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /
  • Temporary restrictions