← Clarigital·Clarity in Digital Marketing
Content Marketing · Session 10, Guide 2

Content Audit & Gap Analysis · Optimise Before You Create

Most content programmes have a fundamental imbalance: they invest heavily in creating new content while largely ignoring the performance of content they already have. A content audit reverses this priority — systematically reviewing every piece of existing content to identify what is performing, what is underperforming and fixable, what should be consolidated, and what should be removed. A thorough content audit typically reveals more value in existing content than six months of new production would create. This guide walks through the complete audit process from inventory to action plan.

Content Marketing5,000 wordsUpdated Apr 2026

What You Will Learn

  • Why auditing existing content consistently outperforms creating new content in ROI
  • How to build a complete content inventory — even for large sites
  • The performance scoring framework that separates strong content from weak
  • How to categorise content into Keep, Improve, Consolidate, and Remove
  • Gap analysis — finding the topics and keywords you should be covering but aren't
  • How to build a prioritised action plan from audit findings
  • When refreshing existing content beats creating new content
  • The tools that make auditing efficient at scale
  • The most common findings in a content audit and what they mean

Why Audit Before Creating

The case for auditing before creating new content is straightforward: existing content has compounding advantages that new content lacks. A three-year-old blog post has indexed in search, accumulated backlinks, built up click-through-rate history in Google's systems, and established topical relevance for its URL. Refreshing that post to current standards — updating statistics, expanding thin sections, improving internal linking, and modernising the title — can produce ranking improvements in weeks that a new post competing on the same topic might take years to achieve.

Research from various SEO practitioners consistently shows that content refreshes to top-performing pages can increase organic traffic by 50–400% — making the ROI on audit-driven optimisation frequently higher than equivalent investment in new content creation. Yet most marketing teams spend the majority of their content budget on new creation and a fraction on existing content — the inverse of the optimal allocation.

A content audit also prevents a common inefficiency: creating new content that duplicates or competes with existing content. Without an inventory of what already exists, writers and strategists frequently propose content on topics the site already covers — sometimes ranking well — without awareness that they would be creating a competing page rather than filling a genuine gap. The audit prevents this duplication before it happens.

Traffic improvement

50–400%

Typical organic traffic increase from refreshing top-performing pages

Cannibalisation risk

Common

Sites with 100+ posts frequently have multiple pages competing for the same queries

Audit frequency

Annual

Full audit annually; quarterly review of top 20% performing pages

Building the Content Inventory

The content inventory is a spreadsheet listing every piece of content on your site — the raw material of the audit. For small sites (under 200 pages), this can be done manually. For larger sites, automated crawling tools are essential.

Data to capture per URL

Your inventory spreadsheet should have one row per URL and the following columns:

  • URL. The full page URL
  • Title / H1. The page title and primary heading
  • Content type. Blog post, landing page, product page, case study, guide, etc.
  • Content pillar. Which strategic topic pillar does this belong to?
  • Funnel stage. TOFU, MOFU, BOFU, or retention
  • Target keyword. The primary keyword this page targets
  • Word count. Approximate length
  • Publish date. When it was first published
  • Last updated. When it was most recently updated
  • Organic sessions (last 12 months). From Google Analytics
  • Average position (last 12 months). From Google Search Console
  • Impressions (last 12 months). From Google Search Console
  • Backlinks. Number of referring domains pointing to this URL
  • Conversions. Leads, sign-ups, or purchases attributed to this page

Automated inventory tools

For sites over 200 pages, use a crawler to extract all URLs: Screaming Frog SEO Spider, Sitebulb, or Ahrefs Site Audit all crawl your entire domain and export URLs with metadata. Connect the export to Google Analytics and Google Search Console data via API or manual export and merge on URL to populate the performance columns. This initial data collection takes 2–4 hours for a site of 500 pages and is the foundation of everything that follows.

Performance Scoring

Performance scoring assigns each piece of content a composite score based on its contribution to your content goals. A simple but effective scoring framework uses three dimensions:

Traffic score (0–10)

How much organic traffic does this page generate relative to your site average?

  • 10: Top 10% of pages by organic sessions
  • 7–9: Above average traffic
  • 4–6: Average traffic
  • 1–3: Below average; low traffic
  • 0: Zero or negligible organic traffic

Conversion score (0–10)

Does this page generate conversions (leads, sign-ups, purchases)?

  • 10: Top 10% of pages by conversion volume
  • 7–9: Above average conversions
  • 4–6: Some conversions
  • 1–3: Low conversion rate; rarely converts
  • 0: Zero conversions; no conversion tracking

Strategic value score (0–10)

How well does this content serve the content strategy — correct persona, correct funnel stage, correct pillar, current and accurate information?

  • 10: Core strategic piece; serves primary persona; accurate and current
  • 7–9: Valuable and relevant; minor updates needed
  • 4–6: Somewhat relevant; significant updating required
  • 1–3: Off-strategy; wrong audience or outdated
  • 0: Completely off-strategy; serves no current goal

Total score (out of 30) guides prioritisation: pages scoring 22+ are strong performers to protect and amplify; pages scoring 10–21 are candidates for improvement; pages scoring under 10 need significant intervention or removal.

Content Categorisation

Once scored, every piece of content receives one of four categories. The categories drive the action plan:

CategoryCriteriaAction
KeepHigh traffic, good conversions, strategically relevant — performing well with minimal issuesProtect: monitor, maintain internal links, ensure continued technical health. Consider amplification (promotion, link building).
ImproveStrategic value but underperforming — ranking positions 4–15, good impressions but low CTR, outdated statistics, thin content, missing sectionsRefresh: update statistics, expand thin sections, improve title and meta, add internal links, improve CTAs
ConsolidateMultiple pages covering the same topic and competing with each other (keyword cannibalisation); or several thin pages that together could form one comprehensive guideMerge: combine content into one comprehensive page; 301 redirect thin pages to the consolidated URL
RemoveNo traffic, no conversions, no backlinks, off-strategy, duplicate content, or outdated information that damages credibilityDelete or redirect: 301 redirect to the most relevant remaining page; do not simply delete without redirecting if any backlinks exist
Never delete without checking backlinks first

Before removing any page, check its backlink profile in Ahrefs or Google Search Console. A page with zero traffic but 15 referring domains pointing to it has link equity worth preserving. 301 redirect it to the most topically relevant remaining page rather than returning a 404 — this passes the link equity to a page that can benefit from it.

Gap Analysis

Gap analysis identifies the topics, questions, and keywords your audience is searching for that your content does not currently cover. It is the bridge between auditing existing content and planning new content.

Methods for identifying content gaps

  • Keyword gap analysis. Compare your keyword rankings against those of your top 3 competitors using tools like Ahrefs, Semrush, or Google Search Console. Keywords where competitors rank in the top 10 but you have no ranking or ranking below position 20 are content gaps. Prioritise gaps on keywords with commercial intent and achievable difficulty.
  • Google Search Console query analysis. Filter GSC queries where your average position is 11–30 (page 2 and 3) — these are topics you have some relevance for but insufficient depth. You have a page that partially covers the topic; a more comprehensive page would likely rank higher.
  • People Also Ask analysis. For your primary topic areas, review the "People Also Ask" boxes in Google search results. Questions appearing there that you do not have content answering are gap opportunities — they are questions Google has confirmed real users are asking.
  • Customer question mining. Review your sales call transcripts, support tickets, and community forum questions. What questions do prospects and customers ask that you do not have content to answer? This type of gap is often more commercially valuable than keyword gaps because the questions come directly from your target audience.
  • Funnel stage gap analysis. Map your content inventory against your content funnel framework. Where are the gaps by funnel stage? Most programmes have plenty of TOFU content and insufficient MOFU and BOFU — identifying this imbalance guides prioritisation.

The Action Plan

The audit produces a large list of potential actions — more than any team can execute simultaneously. Prioritisation is essential. A practical prioritisation framework ranks actions by: expected impact (traffic/conversion improvement); effort required; and strategic importance.

Action plan priority tiers

PriorityActionsRationale
Tier 1 — Immediate (Month 1)Fix technical issues on high-traffic pages; consolidate cannibalising pages; remove clearly off-strategy contentQuick wins; prevent ongoing damage; low risk
Tier 2 — Short-term (Months 2–3)Refresh top 10 pages by organic traffic; improve CTR on high-impression/low-CTR pagesHigh leverage; existing pages with authority; improvements compound quickly
Tier 3 — Medium-term (Months 3–6)Create new content for highest-priority gaps; improve MOFU/BOFU deficitBuilds coverage where strategy demands it
Tier 4 — OngoingSystematic improvement of remaining Improve-category content; monitoring Keep categorySustained programme of incremental improvement

Refresh vs New: Decision Framework

The decision whether to refresh an existing page or create a new page for a given topic is one of the most important strategic decisions in content management. The default should be refresh — existing pages have accumulated advantages that new pages lack. Create a new page only when the topic is genuinely not covered by any existing page.

SituationDecisionReason
Existing page ranking 5–20 for target keywordRefreshPage already has relevance signal; refresh will improve ranking faster than a new page
Existing page ranking 20+ with low impressionsRefresh or newAssess whether the existing page has meaningful backlinks; if so, refresh; if not, evaluate creating a stronger new page
Topic not covered anywhere on siteNewNo existing page to leverage; gap requires new content
Multiple thin pages on related subtopicsConsolidate into new comprehensive pageOne strong page outperforms multiple weak ones; consolidation gathers the link equity
Existing page has significant backlinks but low trafficRefreshThe backlinks have value; refreshing capitalises on existing link equity

Audit Cadence

Content audits should be conducted on a regular cadence — not as a one-time project. The recommended schedule:

  • Full audit: annually. A complete inventory and scoring of all content. Produces the annual content plan. Typically takes 1–2 weeks for a site of 200–500 pages.
  • Quarterly review of top 20%. Review the top 20% of pages by organic traffic quarterly. Are rankings stable? Have statistics become outdated? Are there new sections to add? This prevents gradual decay of your most valuable pages.
  • Monthly GSC review. Review Google Search Console monthly for position changes, CTR anomalies, and query changes on your top pages. Early identification of ranking declines enables proactive refresh before traffic drops significantly.

Tools for Content Auditing

ToolPurposeCost
Google Search ConsoleSearch performance data — impressions, clicks, positions, CTR per URLFree
Google Analytics 4Traffic data, engagement metrics, conversion data per URLFree
Screaming Frog SEO SpiderFull site crawl — extracts all URLs, titles, meta, word counts, status codesFree up to 500 URLs; paid for larger sites
Ahrefs Site AuditTechnical audit + backlink data per URL + keyword rankingsPaid
Semrush Content AuditAutomated content audit pulling in traffic and social dataPaid
Google Sheets / AirtableInventory management, scoring, action plan trackingFree / Freemium

The free tool stack (Screaming Frog free tier for small sites, GA4, Google Search Console) is sufficient for sites under 500 pages. Paid tools (Ahrefs, Semrush) add backlink data and keyword tracking that significantly accelerates gap analysis and prioritisation for larger sites.

Common Audit Findings

The same patterns appear in almost every content audit. Knowing what to expect helps you find them efficiently:

  • Keyword cannibalisation. Multiple pages competing for the same primary keyword — preventing either from ranking strongly. Very common on sites with large content libraries created without a centralised keyword strategy. Fix by consolidating into one comprehensive page.
  • Top-10 pages with outdated statistics. Your best-ranking posts often contain statistics from 3–5 years ago. Readers notice; competitors may have more current data. Refreshing statistics on high-traffic pages is often the highest-leverage single improvement available.
  • Orphaned content. Pages with no internal links pointing to them — invisible to both search engines and users. Identify pages with zero internal links and add relevant links from related content.
  • Missing CTAs. High-traffic blog posts with no calls to action — generating reads but no conversions. Adding contextually relevant CTAs (email sign-up, related guide, free trial) to existing high-traffic pages can dramatically increase their lead generation value.
  • Thin content ranking in positions 8–15. Pages with 400–800 words ranking on page 1 but not in the top 5 — lacking the depth to compete with more comprehensive pages. Expanding these is typically a 2–4 week project that moves them from position 10 to position 3–5.
  • Broken internal links. Internal links pointing to redirected or deleted URLs. These create unnecessary redirect chains that dilute link equity. Fix with consistent monitoring via Screaming Frog.

Authentic Sources

OfficialGoogle Search Central — Helpful Content

Google's guidance on evaluating content quality — the framework underlying content audit decisions.

OfficialGoogle Search Console

Primary data source for content audit performance metrics — impressions, clicks, positions.

OfficialGoogle Analytics 4

Traffic and conversion data for content performance scoring.

OfficialGoogle Search Central — Consolidating Duplicate URLs

Google's guidance on handling duplicate content and consolidation with canonical and 301 redirects.

600 guides. All authentic sources.

Official documentation only.