Semantically Similar
What This Means
Pages that are semantically similar in content based upon the configured cosine semantic similarity threshold. High semantic similarity can be completely normal, but it can also indicate duplicate or overlapping content that should be reviewed.
What Triggers This Issue
This issue is triggered when pages are semantically similar in content, based on a configured cosine similarity threshold (defaulted to 0.95).
How To Fix
Review semantically similar content to ensure highly similar pages should be standalone pages and are not duplicated, covering the same subject multiple times, causing cannibalisation issues, or crawling and indexing inefficiencies. It is normal to have highly semantically similar content when talking about closely related subjects. Leave pages where appropriate, or if required consider making pages more unique, consolidate, block, or remove. Consider internal linking opportunities between highly semantically similar content.