pacegallery.com
robots.txt

Robots Exclusion Standard data for pacegallery.com

Resource Scan

Scan Details

Site Domain pacegallery.com
Base Domain pacegallery.com
Scan Status Ok
Last Scan2024-09-08T11:15:18+00:00
Next Scan 2024-10-08T11:15:18+00:00

Last Scan

Scanned2024-09-08T11:15:18+00:00
URL https://www.pacegallery.com/robots.txt
Domain IPs 2600:9000:2003:2600:9:df6c:b4c0:93a1, 2600:9000:2003:3a00:9:df6c:b4c0:93a1, 2600:9000:2003:4e00:9:df6c:b4c0:93a1, 2600:9000:2003:9000:9:df6c:b4c0:93a1, 2600:9000:2003:a800:9:df6c:b4c0:93a1, 2600:9000:2003:d000:9:df6c:b4c0:93a1, 2600:9000:2003:f800:9:df6c:b4c0:93a1, 2600:9000:2003:fe00:9:df6c:b4c0:93a1, 52.84.229.106, 52.84.229.129, 52.84.229.72, 52.84.229.98
Response IP 52.84.229.129
Found Yes
Hash 3edce9ee1c17c9ae91a2033407d12dae1139945c64eaf2bfd0a1495ba809f23d
SimHash a81c3fc5eff2

Groups

*

Rule Path
Disallow *.pdf$
Disallow /landing_google
Disallow /enterprise/landing_pages/google*
Disallow /private-showing/
Disallow /p/
Disallow /_util/*

Other Records

Field Value
sitemap https://www.pacegallery.com/sitemap.xml

Comments

  • See http://www.robotstxt.org for documentation on how to use the robots.txt file