Forecheck feature and function list

This is a list of all features, collected data and reports. It is not complete, but we try hard…

List of features

Very fast crawler with up to 1,000 parallel requests.

Programmed completely in Unicode

Supports all languages and all encodings (charsets): For displaying all chars, you may have to choose a font that can display the language since most fonts do not support all languages. Common encodings besides the popular UTF-8 are listed at Wikipedia.

Converts all chars into pure Unicode chars: All HTML Entities are converted into pure Unicode chars. Only chars that are defined as images cannot be converted. Content and keyword related functions use the converted unicode strings.

Analysis whenever you want and how often you want and any domain you want.

Gathers SEO related data (see list of collected data to the right).

Find errors and problems on your website and remove them (see reports on the right).

Full text search within all data: You can even search within the source code of all pages (all source codes can be downloaded).

Has many SEO-related analyses (see reports to the right).

Duplicate title / description / content / Hx analysis based on 7 rules (see reports on the right).

Canonical link URL evaluation in all duplicate reports.

Following features are available for most languages, but not for all. Please see the Help within Forecheck for more information:
Keyword stemming
Stop word detection
Language detection based on the content.
These features are used automatically for several content-related functions.

Content analysis including density and relevance.

Keyword extraction from content (stop words are deleted).

Scheduler
Queue (combined with scheduler).

XML Sitemap functions
Import sitemap and check it (check only URL or also crawl the URLs in that sitemap).
Export analyzed data to an XML sitemap.

Folder function: Crawls all files in a local or network folder.

URL List analysis Add your individual list of URLs and analyze them, for example all your backlink URLs.

Analyze mobile redirect: Check if mobile redirects are implemented correctly, see Help on how to proceed.

Export all collected data (as listed to the right) and all reports to CSV or Excel (which keeps the data highlighted).

Load time: Detects load time compared to real-time references.

Soft 404 warnings/errors: Define your own Soft 404 errors with the text search function that analyzes the source code during an analysis (there is another text search function that searches within all collected data).

Search for missing product images in a shop: You can use the soft 404 search function in the search feature in the collected data to detect missing product images in a shop. See the Help on how to proceed.

Analyze tracking code: Check if your tracking code is implemented correctly. See Help for more information.

Cookie handling: Enable or disable Cookies or define your individual Cookie.

User-Agent: Choose your individual User-Agent (3 lists available for browsers, crawlers and mobile or define your individual User-Agent).

Language Switch: Add any Accept-Language Header you like.

Include or exclude folders of a domain.

Delete parameters during an analysis (test parameter settings within Google Webmaster Tools).

Add parameter to all URLs during an analysis.

Check language and browser switches

Set individual http/https handling (treat them as different or the same URLs).

Punycode/IDNS support: Decode all URLs during an analysis or keep them as is.

Subdomain handling: Treat them as the analyzed domain (internal links) or treat them as an external domain.

Enter login data for protected sites/folders.

List of collected data

The following data is collected during a crawl. You can access all collected data in the sheet “All” within the Analysis tab. Some data is huge, which is why it is only visible in that sheet. All listed data is stored locally, and with the full text search function you can search within all of the data.

Data in red has not yet been implemented!
For more details regarding all of the data, please see the Forecheck Help!

URL: Gets an individual unique index.
Source: Where does that URL come from? Possible values:
FC: Through crawling (following links).
US: From the user, for example a URL list.
GA: Imported from Google Analytics.
WT: Imported from Webmaster Tools.
RK: From rankings in the search results.
Title: Meta-Tag Title.
Title Length: Length of title (marked green if length is ok or red if not).
Size: size in kB, marked in yellow if quite small, configurable in settings).
Status Code (Redirect? Not available? Ok?).
Redirect: Shows redirects and types of redirect like Meta Refresh, 3xx, including new location.
Indexability: Is that URL indexable, does it contain a session id or parameters?
Robots: Anything that hinders a crawler from indexing a URL can be seen here, including the reason (robots.txt, Meta-Tag robots).
Content-Type: The original content type.
Link Rel: Shows the link rel type of a URL that was found for the first time.
Internal Links: Number of internal outbound links of a page.
External Links: Number of all external outbound links of a page.
Inbound Links: Number of all internal incoming links (no backlinks from other domains except other subdomains, depending on the settings).
Outbound Links: Number of all outbound links (internal and external).
Language (HTTP Header): Language in the HTTP-Header.
Language (HTML Head): Language in the HTML Head.
Language (Content): Detected language of content, for supported languages see left column.
Canonical Link: Canonical URL of a page, evaluated in the duplicate title/description/Hx/content reports.
Last modified: Server information about last date of modification.
Further information: More information, depending on the crawl results.
Pattern: Results of the search function (search during crawl function).
Page Rank: Page Rank of a page.
Load Time: Load time of a page (server connection time + download time, internally measured, different from metrics from Google Analytics, uses real-time references for rating! See help for all details).
Level: Shortest path (number of clicks) from the Start URL.
Meta Description
Meta-Description Length: Colors show if length is too short or too long.
H1-H4: Contains the content of all found H1 to H4 Tags. Multiple elements are displayed by a double pipe/vertical line.
Link Juice: Internal value of the link popularity.
Text to Code Ratio: Ratio of the content (total chars) to source code length.

The following columns appear only in the sheet “All” within the Analysis tab:

Content: The content of a page.
Content Total Chars: Number of chars in that content (after conversion to Unicode), not the length. All chars, also multibyte chars, are counted as one char! Also HTML-Entities count as one char!
Link Text incoming Links: Contains all link text of incoming links and their frequency.
Link Text Outgoing Links: Contains all link text of all outgoing links.
HTTP-Header: Complete HTTP Header.
Source Code: Complete Source code of a page.

The following columns contain data from Google Analytics. There are many metrics you can choose from. Please see the Help for all available metrics.

 

List of reports

Here is a list of reports that are available. Some reports are just combined information of available data and filters. Also, some reports are only available through a combination of features and reports. Please search the Help file for more information on special tasks (for example, evaluating mobile redirects on a website).
All reports can be exported to CSV or Excel (keeping most of the color information). Forecheck evaluates all of the problems and which problems are more severe. For details please see the Help.
Reports in red have not yet been implemented!

Broken Links -> 4xx internal: List, Page report or Link report.
Broken Links -> 4xx external: List, Page report or Link report.
Server Problems -> 5xx: List, Page report or Link report.
Server problems -> Others: List, Page report or Link report.
Server Problems -> <200: List, Page report or Link report (all Status Code <200).
Redirects:
301 redirects internal
301 redirects external
302 redirects internal
302 redirects external
3xx redirects (Status code 303 to 399)
Meta-Refresh
Redirect chains (any combination of redirects, internal and external).
Mobile redirects: You can analyze all mobile redirects pertaining to your website. This requires some steps, please see the Help on how to perform this.

4xx Error Handling
Timouts: Can be set in the settings. Default is 20 seconds.
Load Time: relative value compared to real-time reference values, see Help for details.
HTTP Header: Evaluation check.

Title: Missing, too short, too long, duplicate title.
Description: Missing, too short, too long, duplicate title.

Hint: Multiple Titles/Descriptions are not implemented, search engines only consider the first title/description and ignore following elements.

Content: No content, short content and Duplicate Content.

Duplicate Title/Description/Content/Hx:
The Duplicate reports are unique since they also consider the Canonical Link URLs and show all problems based on 7 defined rules that define all types of Duplicate Content. This helps in identifying and solving such problems. The Help contains a great deal of information on this topic.

H1 missing: Pages with no H1.
Duplicate H1: Pages with identical H1.
Multiple H1: Pages with more than one H1.

Indexability: Analyzes the URL (parameters, session ID).

Robots information
Forecheck has a column that shows if a crawler is prevented from crawling a URL (by robots.txt entry or meta-tag robots). In that column, such a prevention is marked by a red background color and additionally it displays the reason.
There are two reports that show details:
Blocked by robots: An entry in the robots.txt prevents crawling for the selected User Agent.
CSS and JS files that are blocked are shown as errors.
Pages with noindex: All pages that have a noindex in the meta-tag robots.
Missing canonical: All pages that have no Canonical Link URL.
Link Juice: Internal link popularity value, like page rank. It helps to identify bad internal link structure.

H2-H4 reports:
Hx missing: All pages with missing H2/H3/H4
Duplicate Hx: Pages with Duplicate H2/H3/H4
There are Duplicate H2/H3/H4 reports but also a Duplicate Hx report that shows any identical Hx element within H1, H2, H3 or H4.

Link Profile External: Shows all external links and their link text. Useful report for analyzing domains that have a backlink to your domain.

Are you missing something? There are many things in our pipeline, also see the future features. Feel free to contact us if you have a special request or question.