Semantic Keyword Clustering

Do you frequently question if two keywords can be targeted together on a page, or struggle with a large list of keywords that ChatGPT or other tools can't cluster due to token limits or cost?

What Are the Differences Between SEO Utils' Keyword Clustering and Other Tools?

Here are 2 of the main differences:

Flexible to Switch the Embedding Model

Embedding models in Natural Language Processing (NLP) are designed to convert words, phrases, sentences, or entire documents into numerical vectors. These vectors represent the linguistic features of the text, allowing machines to process and analyze language in a meaningful way.
To do keyword clustering well, you need a good model that's already been trained. With AI growing fast, new models are coming out almost every day. You can visit HuggingFace, a website, to get a free model and use it with SEO Utils to find one that's best for your type of business.
You can also take one of these models and train it more on words specific to your niche or industry. Then, use this customized model in SEO Utils for even better keyword clustering, which can improve your SEO results.

Unlimited Keywords for Clustering

With SEO Utils, you're not restricted in the number of keywords you can cluster. This is a big advantage over other tools that limit you to clustering between 5,000 to 10,000 keywords at a time. Since SEO Utils runs on your computer, it can handle as many keywords as you need, going way beyond these limits.
There's also no credit-based system, meaning you don’t have to pay extra no matter how many keywords you cluster. This can mean big savings, especially in large niches like Gym or Fitness where you might need to cluster a million keywords.
You might think, "Can't I just cluster keywords with ChatGPT or the OpenAI API?" While it's true you can cluster a few hundred keywords with these tools, they hit a limit when you try more than 10,000 keywords due to token limitations. Even with GPT-4 Turbo, which allows more tokens, the quality of clustering decreases with more keywords. It often loses context, doesn't follow instructions well, and misses keywords because you cannot control the temperature parameter in ChatGPT. You can do it with OpenAI API, but the cost is too high.
That's where a dedicated keyword clustering tool like SEO Utils makes a big difference.

Semantic Clustering vs SERP Clustering

In my experience, SERP Clustering always gives you the best result of clustering. However, it comes with many technical issues like proxy rotation, time-consuming, server resources, etc.
Take Larseo's SERP Clustering, for example. It lets you cluster unlimited keywords, but clustering 1 million keywords takes a really long time and can cost about $2,900 (at 0.5 credit per keyword).
On the other hand, using the Semantic Clustering feature in SEO Utils is a different story. You don't have to pay extra, and you can get results as good as SERP Clustering. You can achieve this by fine-tuning your model to suit your specific needs.
SEO Utils will support fine-tuning soon!

How to Download Embedding Models and Use It on SEO Utils?

SEO Utils - How to use Semantic Keyword Clustering
  1. 1.
    First, you can visit this leaderboard: https://huggingface.co/spaces/mteb/leaderboard
  2. 2.
    Click on the "Clustering" tab, and then select the language that matches your keywords.
List of top embedding models
  1. 3.
    You will see the top embedding models based on their clustering task performance.
  2. 4.
    Select one model, for example, https://huggingface.co/thenlper/gte-large
Only select the mode that can be used with Sentence Transformers.
Example a model can be used with Sentence Transformers.
  1. 5.
    Click on the Clone repository to download a model with GIT git-lfs
Download a model
# Make sure you have git-lfs installed (https://git-lfs.com)
git lfs install
git clone https://huggingface.co/thenlper/gte-large
# if you want to clone without large files – just their pointers
# prepend your git clone with the following env var:
GIT_LFS_SKIP_SMUDGE=1
I will provide a list of popular models on Google Drive so that you can easily download them.
  1. 5.
    After downloading a model, open SEO Utils on your machine.
  2. 6.
    Click on the App dropdown, and go to the Settings page.
  1. 7.
    Scroll down to the Keyword Clustering section and enter the path to the downloaded model on your machine. Then hit the Save button.
  1. 8.
    That's all. Now, you can go to the Keyword Clustering page and kick off the process.
English
Dutch
Spanish
Japanese
Filipino
Chinese
Thai
Vietnamese
Arabic
Indonesia
Universal
These models are pre-trained in multiple languages. If you cannot find a model that is pre-trained in a specific language, you can use these universal models. sentence-transformers / all-mpnet-base-v2