✨Semantic Keyword Clustering

Do you frequently question if two keywords can be targeted together on a page, or struggle with a large list of keywords that ChatGPT or other tools can't cluster due to token limits or cost?

What Are the Differences Between SEO Utils' Keyword Clustering and Other Tools?

Here are 2 of the main differences:

Flexible to Switch the Embedding Model

Embedding models in Natural Language Processing (NLP) are designed to convert words, phrases, sentences, or entire documents into numerical vectors. These vectors represent the linguistic features of the text, allowing machines to process and analyze language in a meaningful way.

To do keyword clustering well, you need a good model that's already been trained. With AI growing fast, new models are coming out almost every day. You can visit HuggingFace, a website, to get a free model and use it with SEO Utils to find one that's best for your type of business.

You can also take one of these models and train it more on words specific to your niche or industry. Then, use this customized model in SEO Utils for even better keyword clustering, which can improve your SEO results.

Unlimited Keywords for Clustering

With SEO Utils, you're not restricted in the number of keywords you can cluster. This is a big advantage over other tools that limit you to clustering between 5,000 to 10,000 keywords at a time. Since SEO Utils runs on your computer, it can handle as many keywords as you need, going way beyond these limits.

There's also no credit-based system, meaning you don’t have to pay extra no matter how many keywords you cluster. This can mean big savings, especially in large niches like Gym or Fitness where you might need to cluster a million keywords.

You might think, "Can't I just cluster keywords with ChatGPT or the OpenAI API?" While it's true you can cluster a few hundred keywords with these tools, they hit a limit when you try more than 10,000 keywords due to token limitations. Even with GPT-4 Turbo, which allows more tokens, the quality of clustering decreases with more keywords. It often loses context, doesn't follow instructions well, and misses keywords because you cannot control the temperature parameter in ChatGPT. You can do it with OpenAI API, but the cost is too high.

That's where a dedicated keyword clustering tool like SEO Utils makes a big difference.

Semantic Clustering vs SERP Clustering

In my experience, SERP Clustering always gives you the best result of clustering. However, it comes with many technical issues like proxy rotation, time-consuming, server resources, etc.

Take Larseo's SERP Clustering, for example. It lets you cluster unlimited keywords, but clustering 1 million keywords takes a really long time and can cost about $2,900 (at 0.5 credit per keyword).

On the other hand, using the Semantic Clustering feature in SEO Utils is a different story. You don't have to pay extra, and you can get results as good as SERP Clustering. You can achieve this by fine-tuning your model to suit your specific needs.

SEO Utils will support fine-tuning soon!

How to Download Embedding Models and Use It on SEO Utils?

This documentation is for Semantic Clustering v1. For a better experience and improved performance, please refer to the documentation for Version 2.

Also, version 1 is not available for Linux.

  1. First, you can visit this leaderboard: https://huggingface.co/spaces/mteb/leaderboard

  2. Click on the "Clustering" tab, and then select the language that matches your keywords.

  1. You will see the top embedding models based on their clustering task performance.

  2. Select one model, for example, https://huggingface.co/thenlper/gte-large

Only select the mode that can be used with Sentence Transformers.

  1. Click on the Clone repository to download a model with GIT git-lfs

# Make sure you have git-lfs installed (https://git-lfs.com)
git lfs install
git clone https://huggingface.co/thenlper/gte-large

# if you want to clone without large files – just their pointers
# prepend your git clone with the following env var:
GIT_LFS_SKIP_SMUDGE=1

I will provide a list of popular models on Google Drive so that you can easily download them.

  1. After downloading a model, open SEO Utils on your machine.

  2. Click on the App dropdown, and go to the Settings page.

  1. Scroll down to the Keyword Clustering section and enter the path to the downloaded model on your machine. Then hit the Save button.

  1. That's all. Now, you can go to the Keyword Clustering page and kick off the process.

English
Dutch
Swedish
Spanish
Japanese
Filipino
Chinese
Thai
Vietnamese
Arabic
Indonesia
Universal

These models are pre-trained in multiple languages. If you cannot find a model that is pre-trained in a specific language, you can use these universal models. sentence-transformers / all-mpnet-base-v2

Updated Nov 26, 2024: Semantic Keyword Clustering v2

With the release of SEO Utils v1.23.2, I’ve introduced Semantic Keyword Clustering v2, which leverages a consistent Docker environment for running the clustering Python script. Here’s how this upgrade improves your experience:

  • Streamlined Model Selection: No more manual downloads from HuggingFaceβ€”simply select your model from a dropdown menu, and SEO Utils takes care of the rest.

  • Faster Clustering with GPU Support: Speed up keyword clustering with GPU support for Windows and Linux. While macOS only supports CPU, version 2 still delivers faster clustering compared to version 1!

  • Polished Output: All formatting issues in the output file have been resolved.

  • Improved Reliability: Minimize unexpected issues with a more stable setup.

  • Simplified Updates: Updating the clustering script is now easier for me, as I no longer need to build separate executable scripts for each platform.

Why do I still keep version one? Some users run SEO Utils on VPS, and not all VPS can support Docker. That’s why version 1 is still available as a backup option. If your setup supports Docker, I highly recommend using version 2!

How to Install Docker

  • Visit https://www.docker.com/, download it, and install it as you normally would.

  • Open Docker and follow the app’s instructions to start it as recommended.

You can close Docker if you’re not using Semantic Clustering version 2 anymore.

Switch to Version 2

Step 1: Go to Settings > Services from the left sidebar.

Step 2: Select Version 2 from the dropdown and click the Save button.

Step 3: Open the Semantic Clustering tool as you normally would. You’ll notice a new field called β€œEmbedding Model”. Simply choose a model from the dropdown based on your language to start clustering keywords.

No more complicated setups or manual downloads! SEO Utils will automatically download the model and cache it for future use, so you won’t need to wait for it to re-download every time.

Last updated