β¨Semantic Keyword Clustering
Do you frequently question if two keywords can be targeted together on a page, or struggle with a large list of keywords that ChatGPT or other tools can't cluster due to token limits or cost?
What Are the Differences Between SEO Utils' Keyword Clustering and Other Tools?
Here are 2 of the main differences:
Flexible to Switch the Embedding Model
Embedding models in Natural Language Processing (NLP) are designed to convert words, phrases, sentences, or entire documents into numerical vectors. These vectors represent the linguistic features of the text, allowing machines to process and analyze language in a meaningful way.
To do keyword clustering well, you need a good model that's already been trained. With AI growing fast, new models are coming out almost every day. You can visit HuggingFace, a website, to get a free model and use it with SEO Utils to find one that's best for your type of business.
You can also take one of these models and train it more on words specific to your niche or industry. Then, use this customized model in SEO Utils for even better keyword clustering, which can improve your SEO results.
Unlimited Keywords for Clustering
With SEO Utils, you're not restricted in the number of keywords you can cluster. This is a big advantage over other tools that limit you to clustering between 5,000 to 10,000 keywords at a time. Since SEO Utils runs on your computer, it can handle as many keywords as you need, going way beyond these limits.
There's also no credit-based system, meaning you donβt have to pay extra no matter how many keywords you cluster. This can mean big savings, especially in large niches like Gym or Fitness where you might need to cluster a million keywords.
You might think, "Can't I just cluster keywords with ChatGPT or the OpenAI API?" While it's true you can cluster a few hundred keywords with these tools, they hit a limit when you try more than 10,000 keywords due to token limitations. Even with GPT-4 Turbo, which allows more tokens, the quality of clustering decreases with more keywords. It often loses context, doesn't follow instructions well, and misses keywords because you cannot control the temperature
parameter in ChatGPT. You can do it with OpenAI API, but the cost is too high.
That's where a dedicated keyword clustering tool like SEO Utils makes a big difference.
Semantic Clustering vs SERP Clustering
In my experience, SERP Clustering always gives you the best result of clustering. However, it comes with many technical issues like proxy rotation, time-consuming, server resources, etc.
Take Larseo's SERP Clustering, for example. It lets you cluster unlimited keywords, but clustering 1 million keywords takes a really long time and can cost about $2,900 (at 0.5 credit per keyword).
On the other hand, using the Semantic Clustering feature in SEO Utils is a different story. You don't have to pay extra, and you can get results as good as SERP Clustering. You can achieve this by fine-tuning your model to suit your specific needs.
SEO Utils will support fine-tuning soon!
How to Download Embedding Models and Use It on SEO Utils?
This documentation is for Semantic Clustering v1. For a better experience and improved performance, please refer to the documentation for Version 2.
Also, version 1 is not available for Linux.
First, you can visit this leaderboard: https://huggingface.co/spaces/mteb/leaderboard
Click on the "Clustering" tab, and then select the language that matches your keywords.
You will see the top embedding models based on their clustering task performance.
Select one model, for example, https://huggingface.co/thenlper/gte-large
Only select the mode that can be used with Sentence Transformers.
Click on the Clone repository to download a model with GIT
git-lfs
I will provide a list of popular models on Google Drive so that you can easily download them.
After downloading a model, open SEO Utils on your machine.
Click on the App dropdown, and go to the Settings page.
Scroll down to the Keyword Clustering section and enter the path to the downloaded model on your machine. Then hit the Save button.
That's all. Now, you can go to the Keyword Clustering page and kick off the process.
Popular Models
Updated Nov 26, 2024: Semantic Keyword Clustering v2
With the release of SEO Utils v1.23.2, Iβve introduced Semantic Keyword Clustering v2, which leverages a consistent Docker environment for running the clustering Python script. Hereβs how this upgrade improves your experience:
Streamlined Model Selection: No more manual downloads from HuggingFaceβsimply select your model from a dropdown menu, and SEO Utils takes care of the rest.
Faster Clustering with GPU Support: Speed up keyword clustering with GPU support for Windows and Linux. While macOS only supports CPU, version 2 still delivers faster clustering compared to version 1!
Polished Output: All formatting issues in the output file have been resolved.
Improved Reliability: Minimize unexpected issues with a more stable setup.
Simplified Updates: Updating the clustering script is now easier for me, as I no longer need to build separate executable scripts for each platform.
Why do I still keep version one? Some users run SEO Utils on VPS, and not all VPS can support Docker. Thatβs why version 1 is still available as a backup option. If your setup supports Docker, I highly recommend using version 2!
How to Install Docker
Visit https://www.docker.com/, download it, and install it as you normally would.
Open Docker and follow the appβs instructions to start it as recommended.
You can close Docker if youβre not using Semantic Clustering version 2 anymore.
Switch to Version 2
Step 1: Go to Settings > Services from the left sidebar.
Step 2: Select Version 2 from the dropdown and click the Save button.
Step 3: Open the Semantic Clustering tool as you normally would. Youβll notice a new field called βEmbedding Modelβ. Simply choose a model from the dropdown based on your language to start clustering keywords.
No more complicated setups or manual downloads! SEO Utils will automatically download the model and cache it for future use, so you wonβt need to wait for it to re-download every time.
Last updated