digitalgregg.com
robots.txt

Robots Exclusion Standard data for digitalgregg.com

Resource Scan

Scan Details

Site Domain digitalgregg.com
Base Domain digitalgregg.com
Scan Status Ok
Last Scan2025-10-30T16:42:51+00:00
Next Scan 2025-11-29T16:42:51+00:00

Last Scan

Scanned2025-10-30T16:42:51+00:00
URL https://digitalgregg.com/robots.txt
Redirect https://www.digitalgregg.com/robots.txt
Redirect Domain www.digitalgregg.com
Redirect Base digitalgregg.com
Domain IPs 104.21.22.80, 172.67.203.119, 2606:4700:3031::ac43:cb77, 2606:4700:3033::6815:1650
Redirect IPs 104.21.22.80, 172.67.203.119, 2606:4700:3031::ac43:cb77, 2606:4700:3033::6815:1650
Response IP 172.67.203.119
Found Yes
Hash 69223a4f890a4f12c9b1e162c4c27e9ad0e0f655a5464a4929b711b48d7003f5
SimHash 4840c90347f3

Groups

*

Rule Path
Disallow /accounts
Disallow /welcome/
Disallow *.pdf
Disallow /*.pdf$
Disallow /*.pdf
Disallow *.pdf$

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.digitalgregg.com/sitemap.xml

Comments

  • Block all crawlers for /accounts
  • Allow all crawlers