We want to calculate the similarity of several thousands of texts. The number of texts can go upto 100 K. Each text is in 1 .txt file and each file is named with a number: [login to view URL], [login to view URL], etc.
When this calculation is done for all the pairs of texts, a table must be generated, indicating the number of texts we can extract with a maximum similarity ratio of x % with the value of x going from 0 to 100 by increments of 1. This table must be exportable in pdf, format A4.
After that, we want to extract the less similar texts out of the whole batch, with 2 options offered to user:
● Extract the x less similar texts (user will define the value of x)
● Extract all the text with a maximum similarity of n % (user will define the value of n)
The values of x and n will be different each time we'll use the tool.
This tool must be running on demand on a High Performance Computing service like Amazon's one, for example.
24 freelancers are bidding on average $471 for this job
Hello! I am a programming expert I have checked your project description. I can do it I guarantee the good result. I will wait for your reply. Thank you.
Hi, Sir. how are you ? I read your requirements carefully. I have very good skills in python & Algorithm. I am sure - If you hire me then you will get best result. Let us go together. Thanks.
Hi there, please leave a message on my chat so we can discuss the budget and deadline of the project. I have read your project description and i'm confident i can do this project for you perfectly. Thanks
Hello? Nice to meet you. I am excited to work with you on this project. I am ready to start work immediately. I have good skills in those. So I think I can help you if you want. Thank you. Best Regards.
My preferred method of freelancing is an interactive approach to project solving. I have an MSEE specializing in Digital Signal/Image/RF Processing. I do my work in MATLAB (expert). I also do Python programming.