wral.com
robots.txt

Robots Exclusion Standard data for wral.com

Resource Scan

Scan Details

Site Domain wral.com
Base Domain wral.com
Scan Status Ok
Last Scan2024-05-10T16:33:30+00:00
Next Scan 2024-05-17T16:33:30+00:00

Last Scan

Scanned2024-05-10T16:33:30+00:00
URL https://wral.com/robots.txt
Redirect https://www.wral.com/robots.txt
Redirect Domain www.wral.com
Redirect Base wral.com
Domain IPs 18.154.132.121, 18.154.132.50, 18.154.132.76, 18.154.132.98, 2600:9000:234c:1c00:18:336d:ed40:93a1, 2600:9000:234c:2400:18:336d:ed40:93a1, 2600:9000:234c:8200:18:336d:ed40:93a1, 2600:9000:234c:9800:18:336d:ed40:93a1, 2600:9000:234c:9e00:18:336d:ed40:93a1, 2600:9000:234c:d400:18:336d:ed40:93a1, 2600:9000:234c:da00:18:336d:ed40:93a1, 2600:9000:234c:de00:18:336d:ed40:93a1
Redirect IPs 18.161.6.105, 18.161.6.106, 18.161.6.32, 18.161.6.65, 2600:9000:24db:1000:18:336d:ed40:93a1, 2600:9000:24db:1200:18:336d:ed40:93a1, 2600:9000:24db:4600:18:336d:ed40:93a1, 2600:9000:24db:6400:18:336d:ed40:93a1, 2600:9000:24db:7e00:18:336d:ed40:93a1, 2600:9000:24db:a200:18:336d:ed40:93a1, 2600:9000:24db:ac00:18:336d:ed40:93a1, 2600:9000:24db:ce00:18:336d:ed40:93a1
Response IP 18.165.171.120
Found Yes
Hash 6b6bac37cda85d824a113e997bc4739e9cee579d70c9617d231cd5c417b4b3a0
SimHash 8d37580db5d4

Groups

grapeshot
ia_archiver
bingbot
bing
facebot
facebookexternalhit
googlebot
google
mediapartners-google
slurp
twitterbot

Rule Path
Disallow /apps/
Disallow /*?print_friendly
Disallow /golo/page/1896337/
Disallow /rs/page/2173068/
Disallow /weather/page/1010362/?id=*
Disallow /weather/?wc_img=*
Disallow /arrest-photos/8981628/
Disallow /sports/
Disallow /search/
Disallow /content/creative_services/promos/clickthru?*
Disallow /suggest-a-correction/

Other Records

Field Value
sitemap http://www.wral.com/sitemap_index.xml

Comments

  • (www.)wral.com robots.txt