languagehumanities.org
robots.txt

Robots Exclusion Standard data for languagehumanities.org

Resource Scan

Scan Details

Site Domain languagehumanities.org
Base Domain languagehumanities.org
Scan Status Ok
Last Scan2024-09-26T00:28:37+00:00
Next Scan 2024-10-03T00:28:37+00:00

Last Scan

Scanned2024-09-26T00:28:37+00:00
URL https://languagehumanities.org/robots.txt
Redirect https://www.languagehumanities.org/robots.txt
Redirect Domain www.languagehumanities.org
Redirect Base languagehumanities.org
Domain IPs 52.52.207.191, 52.9.164.247
Redirect IPs 108.157.254.108, 108.157.254.50, 108.157.254.81, 108.157.254.93, 2600:9000:2753:1200:9:2198:cb00:93a1, 2600:9000:2753:600:9:2198:cb00:93a1, 2600:9000:2753:7c00:9:2198:cb00:93a1, 2600:9000:2753:8800:9:2198:cb00:93a1, 2600:9000:2753:9800:9:2198:cb00:93a1, 2600:9000:2753:a600:9:2198:cb00:93a1, 2600:9000:2753:d600:9:2198:cb00:93a1, 2600:9000:2753:da00:9:2198:cb00:93a1
Response IP 108.157.254.108
Found Yes
Hash 013e69127d3c192831841c24f77a7662e9605f62578fac8ab7ed4a26c897db86
SimHash 8b00d73e7313

Groups

*

Rule Path
Disallow /s/
Disallow /templates/
Disallow /d/
Disallow /related/
Disallow /relevant/
Disallow /videos/
Disallow /captcha.php
Disallow /*?expand_article
Disallow /*.js?cb=
Disallow /quizzes*

mediapartners-google

Rule Path
Allow /s/
Allow /related/
Allow /relevant/

Other Records

Field Value
sitemap https://www.languagehumanities.org/sitemap-languagehumanities.org-index.xml