howlearngerman.com
robots.txt

Robots Exclusion Standard data for howlearngerman.com

Resource Scan

Scan Details

Site Domain howlearngerman.com
Base Domain howlearngerman.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-05-31T22:12:10+00:00
Next Scan 2024-08-29T22:12:10+00:00

Last Successful Scan

Scanned2023-11-04T14:50:55+00:00
URL https://howlearngerman.com/robots.txt
Domain IPs 192.0.78.180, 192.0.78.208
Response IP 192.0.78.180
Found Yes
Hash f7e2c3413c5229692f915e56192d2c7a2eed5727f36e2c86bcbe16b082bd23f1
SimHash 218c4ae0d899

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-content/themes/
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf

*

Rule Path
Disallow /wp-content/uploads/wpo-plugins-tables-list.json

Other Records

Field Value
sitemap https://howlearngerman.com/sitemap.xml