virtusa.com
robots.txt

Robots Exclusion Standard data for virtusa.com

Resource Scan

Scan Details

Site Domain virtusa.com
Base Domain virtusa.com
Scan Status Ok
Last Scan2024-09-16T16:40:32+00:00
Next Scan 2024-10-16T16:40:32+00:00

Last Scan

Scanned2024-09-16T16:40:32+00:00
URL https://virtusa.com/robots.txt
Redirect https://www.virtusa.com/robots.txt
Redirect Domain www.virtusa.com
Redirect Base virtusa.com
Domain IPs 20.10.90.192
Redirect IPs 96.17.96.14, 96.17.96.23
Response IP 23.44.4.162
Found Yes
Hash 260b537950fa2d5ba5138e0e9f9634fb1a2e031f78926ed6561a00cb56191ccd
SimHash 81040c90e3b2

Groups

*

Rule Path
Allow /
Disallow /content/virtusa/language-masters/
Disallow /content/virtusa/en/forms/
Disallow /content/virtusa/de/forms/
Disallow /forms/

Other Records

Field Value
sitemap https://www.virtusa.com/content/virtusa/en/repositories.sitemap.xml
sitemap https://www.virtusa.com/content/virtusa/de/repositories.sitemap.xml

Comments

  • Any search crawler can crawl our site
  • Allow only below mentioned paths
  • Disallow everything else
  • Crawl all sitemaps mentioned below