media.pearsoncmg.com
robots.txt

Robots Exclusion Standard data for media.pearsoncmg.com

Resource Scan

Scan Details

Site Domain media.pearsoncmg.com
Base Domain pearsoncmg.com
Scan Status Ok
Last Scan2024-10-22T19:48:31+00:00
Next Scan 2024-11-21T19:48:31+00:00

Last Scan

Scanned2024-10-22T19:48:31+00:00
URL https://media.pearsoncmg.com/robots.txt
Domain IPs 23.210.109.212
Response IP 23.210.109.212
Found Yes
Hash 407da5bfa8092872323642b1e4c874144fdcc2675bc2883c4212339cf2443387
SimHash e854da624b10

Groups

gsa-crawler

Rule Path
Disallow

wps_site_robot

Rule Path
Disallow

*

Rule Path
Disallow /