corp-sansan.com
robots.txt

Robots Exclusion Standard data for corp-sansan.com

Resource Scan

Scan Details

Site Domain corp-sansan.com
Base Domain corp-sansan.com
Scan Status Ok
Last Scan2024-05-29T14:00:16+00:00
Next Scan 2024-06-28T14:00:16+00:00

Last Scan

Scanned2024-05-29T14:00:16+00:00
URL https://corp-sansan.com/robots.txt
Redirect https://www.corp-sansan.com/robots.txt
Redirect Domain www.corp-sansan.com
Redirect Base corp-sansan.com
Domain IPs 104.18.0.136, 104.18.1.136
Redirect IPs 104.18.0.136, 104.18.1.136
Response IP 104.18.1.136
Found Yes
Hash 8e5949707d61e99290285654d0b42e25081bf024b8d923725d73fa6355545b47
SimHash 6908cd600b92

Groups

*

Rule Path
Disallow /corp/wp-admin/
Allow /corp/wp-admin/admin-ajax.php
Disallow /corp/wp-content/uploads/media-from-ftp-tmp/

Other Records

Field Value
sitemap https://www.corp-sansan.com/sitemap.xml