thewom.it
robots.txt

Robots Exclusion Standard data for thewom.it

Resource Scan

Scan Details

Site Domain thewom.it
Base Domain thewom.it
Scan Status Ok
Last Scan2024-09-19T18:20:21+00:00
Next Scan 2024-09-26T18:20:21+00:00

Last Scan

Scanned2024-09-19T18:20:21+00:00
URL https://thewom.it/robots.txt
Redirect https://www.thewom.it/robots.txt
Redirect Domain www.thewom.it
Redirect Base thewom.it
Domain IPs 176.34.239.234, 34.248.24.67
Redirect IPs 2600:1413:b000:6::17d5:2bc9, 2600:1413:b000:6::17d5:2bce, 96.17.96.12, 96.17.96.32
Response IP 23.44.4.136
Found Yes
Hash 1013a05cbe0986c3aaa6f4677dfddd3e453cd6421124daba0f4ebe1c6940d1b6
SimHash 4154dd42e591

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thewom.it/sitemap_index.xml
sitemap https://www.thewom.it/news-sitemap.xml