utagawavtt.com
robots.txt

Robots Exclusion Standard data for utagawavtt.com

Resource Scan

Scan Details

Site Domain utagawavtt.com
Base Domain utagawavtt.com
Scan Status Ok
Last Scan2025-03-07T14:41:33+00:00
Next Scan 2025-03-14T14:41:33+00:00

Last Scan

Scanned2025-03-07T14:41:33+00:00
URL https://utagawavtt.com/robots.txt
Redirect https://www.utagawavtt.com/robots.txt
Redirect Domain www.utagawavtt.com
Redirect Base utagawavtt.com
Domain IPs 2001:4b98:dc0:43:f816:3eff:feaa:6ac2, 92.243.25.133
Redirect IPs 2001:4b98:dc0:43:f816:3eff:feaa:6ac2, 92.243.25.133
Response IP 92.243.25.133
Found Yes
Hash 4da3204ff8f146c6f885bbcde5016e26f5b55cae71f3fbd4af2a0c36070da95d
SimHash 401e256e43fb

Groups

*

Rule Path
Disallow /eyeblaster
Disallow /addineyeV2.html
Disallow /forum_v3/post-
Disallow /forum_v3/updates-topic
Disallow /forum_v3/stop-updates-topic
Disallow /forum_v3/index.php?
Disallow /forum_v3/error.php
Disallow /forum_v3/-br
Disallow /forum_v3/mark
Disallow /forum_v3/image-
Disallow /forum_v3/1-
Disallow /forum_v3/posting.php?
Disallow /forum_v3/groupcp.php
Disallow /forum_v3/profile.php?
Disallow /forum_v3/memberlist.php
Disallow /forum_v3/faq.php
Disallow /forum_v3/posting.php
Disallow /forum_v3/groupcp.php
Disallow /forum_v3/search.php
Disallow /forum_v3/login.php
Disallow /forum_v3/privmsg.php
Disallow /forum_v3/membre
Disallow /forum_v3/ucp.php
Disallow /forum_v3/style.php
Disallow /templates
Disallow /scripts
Disallow /adm
Disallow /article
Disallow /api
Disallow /css
Disallow /newsletter
Disallow /print
Disallow /cse-context.xml
Disallow /BingSiteAuth.xml
Disallow /GOOGLEe0546c569c60b87c.html
Disallow /T0wEq12pS7iAlI_XJqIu6nOfH0I.html
Disallow /oneall_redirect.php
Disallow /newsletter
Disallow /pinterest-784dc.html
Disallow /sniply-57134c5ad5c218133058f3c5.html
Disallow /*.json$

Other Records

Field Value
sitemap https://www.utuagawavtt.com/sitemap.xml
sitemap https://www.utagawavtt.com/forum_v3/sitemap.xml

Comments

  • Définition des sitemap
  • eyeblaster (pub adsense)
  • Forum phpbb
  • Backoffice UtagawaVTT
  • Block files ending in .json
  • The asterisks allows any file name
  • The dollar sign ensures it only matches the end of an URL and not a oddly formatted url (e.g. /locations.json.html)
  • Blocking AI bots
  • Bloque ChatGPT crawl
  • User-Agent: GPTBot
  • Disallow: /
  • User-Agent: ChatGPT-User
  • Disallow: /
  • Blocking Google AI (Bard and Vertex AI generative APIs)
  • User-agent: Google-Extended
  • Disallow: /
  • Blocking commoncrawl (CCBot)
  • User-agent: CCBot
  • Disallow: /
  • Speech synthesis only?
  • User-agent: FacebookBot
  • Disallow: /
  • Multi-purpose, commercial uses; including LLMs
  • User-agent: Omgilibot
  • Disallow: /