Hello,
The feature “Ignore these words while comparing” is not works,
Sorry for my English.
Thanks you !
After upgrading to WordPress 4.5.2, this plugin does not work.
]]>In my blog i have thousands messages and i can’t compare it each day , cause the excessive cpu usage. So i can run it only each 2/3 weeks and it take a very long time.
Why don’t add the possibility to execute it in the cron job? If it run continuously (for example one time each hour), doesn’t load the cpu and the results are always updated.
Thanks
]]>Is it possible to search duplicates on pending or draft posts, not just on published articles?
thanks
]]>I started using this plugin about 18 months ago as a means for “catching” duplicate post titles from publishing. On my blog, I schedule several hundred posts at a time that drip out over the course of a week or so.
This plugin used to be very effective at catching any “near duplicate” titles and then putting them in a “pending status”.
It seems that this functionality no longer works. Is there any way I can still use this plugin to monitor a scheduled post at publish time and send it to a pending status if the title is more than 95% the same as an existing, published post title?
Thanks!
]]>Is it possible to filter the post search by a specific category? I have a huge number of posts, and the search will last for a very long time ??
]]>Would love to be ablel to select multiple item types in the left hand column for comparison, ESPECIALLY comparing posts and pages at the same time. Or am i missing something?
]]>Hi,
The plugin works really nice when I put some similar articles and trigger manual finding.
But I have some question with the settings page of Find duplicates.
First, after I active the automatic duplicate handling, how do I know when the auto-handling will happen ? Can I customize the time ?
Second, what dose the “Manual duplicate handling” on settings page mean ? It’s there already another manual tool in the tools page ?
Come on what’s this?
I filtered the content by 80%,60% and even 10%. I received variations by three to six 100% similar content messages.
How to fix this?
]]>Correct me if I’m wrong, but I examined the code (after it failed to retrieve the *exactly duplicated* posts) and found out what the plugin does is basically get ALL of your posts in the database, and run through ALL of them in a loop, breaking the words and compare them one by one with each run. Being a WP plugin author myself, this isn’t exactly something I would do.
My site has ~6000 of posts right now, and this causes a really memory issue. If you have a relative big site, you may want to reconsider about using it. Otherwise, it still works great.
]]>Hi Markus,
I really love your plugin, it’s great!
However, I just noticed recently that this plugin causes high I/O usage every time I run it.
When it starts checking for dupes, the I/O usage jumps to the max limit, when I stop the process, the I/O usage drops to zero.
Is there a way to lower the I/O usage so I do not give heavy burden to my hosting?
FYI, my hosting allows up to 5120 KB/s of I/O usage, so when the plugin is running, the I/O usage is 5120/5120 KB/s
Thanks.
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>First, thank you for the great plugin.
Just an issue I thought you should be aware of. After updating to WordPress 3.5.2 I tried to continue an old search.
I am seeing this error on the right hand column under include these statuses of the tools>find duplicates page
“in_array() [function.in-array]: Wrong datatype for second argument in /mysite.com/wp-content/plugins/find-duplicates/tpl/search.php on line 43”
same page between the two boxes (search settings and log) I get
“Warning: Invalid argument supplied for foreach() in /mysite.com/wp-content/plugins/find-duplicates/tpl/search.php on line 123”
I saw similar errors on the settings>find duplicates page, unfortunately did not record the exact error. I was able to remove them on that page by entering a new set of settings and pressing save.
I think it was due to me being “mid search” when I upgraded wp. Any ideas or suggestions? If no, thanks again anyway and great tool.
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hallo Markus,
ich finde dein Plugin ist eine absolute Wohltat und das Beste weit und breit um duplictes zu filtern. Alle anderen funktionieren irgendwie überhaupt nicht sonderlich gut bis gar nicht.
Echt eine super Arbeit, die Du da zur Verfügung gestellt hast. Wenn Du in allem so gründlich arbeitest, gibt’s mit Sicherheit demn?chst noch ein paar Auftr?ge meinerseits.
Aber nun genug des Honigs (-;
Ich h?tte da mal eine Frage zum Plugin:
Ich nutze dies um auf einer Jobsuchseite doppelt und dreifaches (manchmal auch mehr …) auszusortieren. Ich arbeite derzeit haupts?chlich noch mit XML Importen, die haufenweise Mehrfaches erzeugen.
Nun werden die duplicates ja in “Ausstehend” bzw. “Pending” geschoben.
Da ich die dort gelagerten aber absolut nicht gebrauchen kann, w?re es für meinen Fall von Vorteil, dass die gleich in den Papierkorb wandern.
Denn bei mehreren Hunderten doppelter Eintr?ge t?glich, ist es recht zeitaufwendig maximal 100 je Vorgang (mehr will er einfach nicht) von “Ausstehend” zun?chst in den Papierkorb zu schieben und dann erst l?schen zu k?nnen.
Was müsse ich wo in welcher Zeile ?ndern um das hinzubekommen?
Ich w?re super glücklich, wenn das so funktionieren würde. Noch schicker w?re es sogar, wenn die gleich komplett von der Bildfl?che verschwinden – also nicht einmal im Papierkorb landen.
Dies sollte sich idealerweise aber nur auf den automatischen Abgleich bei neu eintreffen doppelten Eintr?gen beziehen. Bei manuell k?nnen die gern auf “Pending” bleiben.
W?re das durchführbar?
Für deine Antwort bereits lieben Dank vorab.
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>I am getting this error:
Fatal error: Maximum execution time of 30 seconds exceeded in /home/content/61/9608761/html/wp-content/plugins/find-duplicates/find-duplicates.php on line 234
:SyntaxError: JSON.parse: unexpected character
What is this and how to get rid of same?
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hello!
I like this plugin a lot. My client had some duplicate Pages and I run this plugin to search for duplicate titles. But I was surprised that it didn’t find any duplicates when it was obvious duplicates existed. It turned out that some titles had “html entities” while other had the real characters, for example: " vs “.
It seems to me that it might be useful to add a call to html_entity_decode on lines 143 and 144:
$post_compare = html_entity_decode( $post_compare );
$post2_compare = html_entity_decode( $post2_compare );
Cheers!
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hey, Markus,
i have some posts with _almost_ duplicated titles and it would be great to exclude some words while comparing.
I will give you example. I have post titles:
a) CarMake1 KEYWORD – Carmodel1
b) KEYWORD CarMake1 – Carmodel1
c) KEYWORD Carmake1 – Carmodel2
Results of searching duplicates now would be like (just as example, not real numbers):
a>b 80%
a>c 80%
b>a 80%
b>c 80%
etc.
But if i would exclude KEYWORD (or maybe couple of keywords) in post titles from calculation, my array becomes this:
a) CarMake1 – Carmodel1
b) CarMake1 – Carmodel1
c) Carmake1 – Carmodel2
And i would see that a and b are 100% dupes and can delete them without checking.
So feature to exclude keywords from calculation of match by titles would be very handy!
Thank you
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>One click delete button of older or newer posts it’s great, however, sometimes you might want to delete several results from the found duplicates list.
Would it be possible to add an individual javascript delete button next to each post in the list or perhaps a select box and one submit button?
Thanks.
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Great plugin!
I store quite a lot of information in the form of custom fields which would be a really good source of information for duplicate post detection as the data is structured.
I’m wondering whether it would be possible to add support for this?
Thanks
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hey, I’ve noticed that there are false positives in some of my runs, and if I raise the percentage threshold high enough to exclude them I also find myself excluding legitimate duplicates.
I was wondering if it would be possible to add a button next to each entry in the list of duplicates to remove that entry?
Thanks for the plugin!
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hello, do you think its possible to check both content and title on an automatic submission?
It doesn’t need to be combined but two separate filters at the same time.
1. go through title filter
2. go through content filter
3. if both pass, go publish. Else go pending for a review.
Right now I filter content and it works great but I would love to filter the duplicate titles in the same run.
Thanks a lot for the great work you put into this.
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Greetings.
First of all, I’d like to say thank you for the very useful plugin.
Secondly, there is an issue if there are multiple duplicates of the same post.
For example, let’s say you have a script that posts from an RSS feed and it automatically posts the same article 11 times.
You run the plugin, and it picks up the 11 posts, if you hit ‘delete old posts’ the log will look like this:
Deleted: 9195
Deleted: 9195
Deleting 9195 failed.
Deleting 9195 failed.
Deleting 9195 failed.
Deleting 9195 failed.
Deleting 9195 failed.
Deleting 9195 failed.
Deleting 9195 failed.
Deleting 9195 failed.
Then, when you look in the trash posts under ‘all posts’ none of the posts will be there, they appear to have been permanently deleted.
I’d speculate that the first use of the delete function moves the post to trash, the second deletes forever and after that it starts throwing errors.
I hope this information helps!
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hi!
Great Plugin!
But I missed the functiuon to delete the older posts instead of the newer.
Any chance to do this?
Regards,
Kai
https://www.ads-software.com/extend/plugins/find-duplicates/
[No bumping, thank you.]
]]>Just one question.
Could you explain in a litter more detail the “Filter out HTML-Tags while comparing” option?
Thanks for the great plugin!
Steve
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Hey. This is a very great plugin. I tested it out and it works great.
Now, I want to run this as a Cron job. Find duplicates and delete automatically. I know its risky but setting the percentage to high should be safe enough.
Anyhow, I attempted, but the it only runs for the first post. It looks like I have loop and pass in the already processed posts.
Has anybody tried?
[Code moderated as per the Forum Rules. The maximum number of lines of code that you can post in these forums is ten lines. Please use the pastebin]
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>Is a great plugin with high accuracy, but I think you need to add another filter to search by categories, because I have categories where are text posts and categories where are only posts with photos without words(only title and some tags), and they are considered as duplicates with the percentage of 80% to other posts.
Here is a printscreen: https://www.yeti.ro/printscreen.png
First 11 rows are ok but after that in right column is a photo post and in left column are posts with text.
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>I have over 200+ WP blogs, and have been in constant contact with the plug in creator Markus, who I might add has been EXCELLENT in feedback and assisting me when I have issues.
He emailed me to let me know he has updated the plugin and I LOVE the update. I can now delete all found duplicates with a single click. It moves the duplicate posts to the trash for me to review and ensure they are indeed a duplicate.
Currently I would recommend this plug in if you are looking to remove duplicate posts to ensure your not targeted by Panda.
Great Job Markus, keep the updates coming.
-Michael
https://www.ads-software.com/extend/plugins/find-duplicates/
]]>While the intent and format of this plugin are excellent, it failed to return any results even though I had duplicate posts that were absolutely identical in every manner. Hopefully, this will be fixed soon as it seems a promising option for sites with large volumes of content.
]]>