usagif.com
robots.txt

Robots Exclusion Standard data for usagif.com

Resource Scan

Scan Details

Site Domain usagif.com
Base Domain usagif.com
Scan Status Ok
Last Scan2024-06-13T06:19:45+00:00
Next Scan 2024-06-20T06:19:45+00:00

Last Scan

Scanned2024-06-13T06:19:45+00:00
URL https://usagif.com/robots.txt
Domain IPs 104.21.76.211, 172.67.201.61, 2606:4700:3034::6815:4cd3, 2606:4700:3037::ac43:c93d
Response IP 104.21.76.211
Found Yes
Hash 0b48ccade746bbec3282b674b40b4b145595eb40222a8f90dffabebda7d5aebe
SimHash 4b808a63eb23

Groups

*

Rule Path
Allow /wp-content/uploads/*
Disallow /wp-admin/
Disallow /search.html
Disallow /search.html?*
Allow /wp-admin/admin-ajax.php
Disallow /*/search/*
Disallow /search/*

yandex

Rule Path
Disallow /ja/*
Disallow /ar/*
Disallow /es/*
Disallow /sv/*
Disallow /da/*
Disallow /nl/*

Other Records

Field Value
sitemap https://usagif.com/sitemap_index.xml

Warnings

  • `clean-param` is not a known field.