
Google Search Appliance: Administrative API Developer’s Guide: Java 24
Crawl Errors:
Crawl Exclusions:
Errors Retrieval Error
7 Redirect without a location header
11 Document not found (404)
12 Other HTTP 400 errors
14 HTTP 0 error
15 Permanent DNS failure
16 Empty document
17 Image conversion failed
22 Authentication failed
25 Conversion error
32 HTTP 500 error
33 The robots.txt file is unreachable
35 Temporary DNS failure
36 Connection failed
37 Connection timeout
38 Connection closed
40 Connection refused
41 Connection reset
43 No route to host
50 Other error
Excluded Description
3 Not in the URLs to crawl
4 In the URLs to not crawl
5 Off domain redirect
6 Long redirect chain
8 Infinite URL space
9 Unhandled protocol
10 URL is too long
13 The robots.txt file indicates to not index
18 Rejected by rewrite rules
19 Unknown extension
20 Disallowed by a meta tag
24 Disallowed by the robots.txt file
Comentários a estes Manuais