qasgx.com
robots.txt

Robots Exclusion Standard data for qasgx.com

Resource Scan

Scan Details

Site Domain qasgx.com
Base Domain qasgx.com
Scan Status Ok
Last Scan2024-11-05T12:53:30+00:00
Next Scan 2024-11-19T12:53:30+00:00

Last Scan

Scanned2024-11-05T12:53:30+00:00
URL https://qasgx.com/robots.txt
Redirect https://www.qasgx.com/robots.txt
Redirect Domain www.qasgx.com
Redirect Base qasgx.com
Domain IPs 23.209.46.147, 23.209.46.148, 2600:1413:b000:1c::17d1:2ee0, 2600:1413:b000:1c::17d1:2ee3
Redirect IPs 23.32.29.9, 2600:1413:b000:1b::17d7:705, 2600:1413:b000:1b::17d7:707, 96.17.180.48
Response IP 104.88.71.83
Found Yes
Hash e9ec023094d490bb261760eb2a235529bf416dc7928c8538e861705fce7cb896
SimHash aa18d71347f0

Groups

*

Rule Path
Disallow /

Comments

  • SGX.com V2 robots.txt
  • Tested against https://www.google.com/webmasters/tools/robots-testing-tool
  • Last Update: 9 December 2019
  • Dev and QA entry to Disallow bots and crawlers
  • Below this line is used in Production
  • User-agent: Twitterbot
  • Disallow: /
  • User-agent: Mediapartners-Google
  • Disallow: /
  • User-agent: *
  • Allow: /
  • Disallow: /assets/static/e-learning/