advent.com
robots.txt

Robots Exclusion Standard data for advent.com

Resource Scan

Scan Details

Site Domain advent.com
Base Domain advent.com
Scan Status Ok
Last Scan2025-08-21T15:34:04+00:00
Next Scan 2025-09-20T15:34:04+00:00

Last Scan

Scanned2025-08-21T15:34:04+00:00
URL https://advent.com/robots.txt
Redirect https://www.advent.com/robots.txt
Redirect Domain www.advent.com
Redirect Base advent.com
Domain IPs 104.19.191.28, 104.19.208.28
Redirect IPs 162.159.140.127, 172.66.0.125, 2606:4700:7::7d, 2a06:98c1:58::7d
Response IP 172.66.0.125
Found Yes
Hash 973b5fb09ef5d0c42ef824200e15f9f1a5e90db2d931c28b56c5df95094481d5
SimHash 3a1379437db0

Groups

*

Rule Path
Disallow /bin/
Disallow /config/
Disallow /umbraco/
Disallow /views/
Disallow /scripts/
Disallow /css/
Disallow /media/
Disallow /assets/
Disallow *.pdf

Other Records

Field Value
sitemap https://www.advent.com/sitemap/

Comments

  • To add a comment to the file, start the line with the # character.
  • User-Agent is used to target a particular web crawler.
  • Any rules declared below it will apply to that User-Agent.
  • To hide a file or folder from the User-Agent, type the word 'Disallow' followed by a semi-colon.

Warnings

  • 1 invalid line.