help4all.pbworks.com
robots.txt

Robots Exclusion Standard data for help4all.pbworks.com

Resource Scan

Scan Details

Site Domain help4all.pbworks.com
Base Domain pbworks.com
Scan Status Ok
Last Scan2024-09-12T06:17:07+00:00
Next Scan 2024-10-12T06:17:07+00:00

Last Scan

Scanned2024-09-12T06:17:07+00:00
URL https://help4all.pbworks.com/robots.txt
Domain IPs 208.96.18.237, 208.96.18.238
Response IP 208.96.18.238
Found Yes
Hash cb9cbb2d7631e062b585b0e85504bca0a2ceed6ac71ee2727f53d2ad892409cd
SimHash 6349da8bebb0

Groups

googlebot

Rule Path
Disallow /*?
Disallow /*%5C.
Allow /f/*

*

Rule Path
Disallow /session/
Disallow /settings/
Disallow /browse/
Disallow /w/browse/
Disallow /layout/
Disallow /rss.xml
Disallow /report.php
Disallow /w/newpage
Disallow /tags.php
Disallow /request_access.php
Disallow /theme_css.php
Disallow /view_img.php
Disallow /upgrade.php
Disallow /2-easy-ways
Disallow /RecentActivity
Disallow /WhatWikiIs
Disallow /changelist.php
Disallow /changes/
Disallow /changes.php
Disallow /AllPages
Disallow /FindPage
Disallow /admin.php
Disallow /help.php
Disallow /report.php
Disallow /contact.php

Other Records

Field Value
crawl-delay 600

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://help4all.pbworks.com/sitemap.xml

Comments

  • Public, indexed site.