usagif.com
robots.txt

Robots Exclusion Standard data for usagif.com

Resource Scan

Scan Details

Site Domain usagif.com
Base Domain usagif.com
Scan Status Ok
Last Scan2024-09-26T15:57:20+00:00
Next Scan 2024-10-03T15:57:20+00:00

Last Scan

Scanned2024-09-26T15:57:20+00:00
URL https://usagif.com/robots.txt
Domain IPs 104.21.76.211, 172.67.201.61, 2606:4700:3034::6815:4cd3, 2606:4700:3037::ac43:c93d
Response IP 172.67.201.61
Found Yes
Hash 49f99b6cd7e9f1cab9c9fd6c43391f4021a9254f82b0f5e3c62a099b3cecbe65
SimHash 4b808a63d923

Groups

*

Rule Path
Allow /wp-content/uploads/*
Disallow /wp-admin/
Disallow /search.html
Disallow /search.html?*
Allow /wp-admin/admin-ajax.php
Disallow /*/search/*
Disallow /search/*
Disallow /gif/*

yandex

Rule Path
Disallow /ja/*
Disallow /ar/*
Disallow /es/*
Disallow /sv/*
Disallow /da/*
Disallow /nl/*

Other Records

Field Value
sitemap https://usagif.com/sitemap_index.xml

Warnings

  • `clean-param` is not a known field.