Language Checker App

Completed Posted Apr 28, 2011 Paid on delivery
Completed Paid on delivery

I need a application that can do the following:

As input there will be a .txt file

The txt file will have thousands of urls of website pages. (it's a specific website that has alot of pages , each page has a section there users can post comments, exactly this section will be checked for language.)

The application will go to the url and check if the majority of the content is in a specific language. if yes it saves this url in another .txt file. If a page has a language that doesn't fit the required one this url will be saved also in a separate .txt file.

It doesn't have to be 100% accurate.

Please send me a message how you plan to approach this project.

It's important that the application can process atleast 1 url per 10 seconds (if faster is possible its even better)

I'm not really sure how to solve this so feel free to share ideas.

One idea would be in another .txt file there are like 100 or more of the top words of a chosen language (i can provide these)

and it checks if there is a certain percentage of them in the webpage.

It's very important that the application only checks a specific region of the web page (the comment section) but that should be easy to solve.

i have another option that will make the job easier.

Currently i have a bot that can save the url + the fetched text into a ms acces database.

Also there are already language detection libraries avaible on the net.
example: http://developer.cybozu.co.jp/oss/2010/10/language-detect.html

so basically all that the programm has to do is go through the database and put a check mark or even tag to the urls.

Please adjust your bid for this option in the long run it's better for me!

attached is a sample ms acces database.

your application must analyse the text in the "text" field and then put the language tag in the "Language" field.

you can use external freely avaible language checker libraries. or even the google translate api.

.NET C++ Programming Java

Project ID: #1041047

About the project

15 proposals Remote project Active Apr 29, 2011

Awarded to:

pureMJ

please see the PM

$75 USD in 3 days
(9 Reviews)
3.3

15 freelancers are bidding on average $187 for this job

srinichal

I can deliver the code in .net with C# asap

$220 USD in 4 days
(107 Reviews)
7.1
barundebnath

Please check your inbox.

$400 USD in 20 days
(88 Reviews)
6.5
ineedWorkJob

sir plz check pm !

$220 USD in 3 days
(124 Reviews)
6.5
RedOctober

Hello, basilgood! See your PMB please.

$240 USD in 5 days
(5 Reviews)
5.8
Murzka

Can be done with Java!

$160 USD in 8 days
(27 Reviews)
6.2
crypted

Please check your pmb.

$300 USD in 3 days
(29 Reviews)
5.1
omniaibrahim

Dear Sir I am an experienced J2SE/J2EE developer with extensive knowledge in swing, Applets, servelts, jsp , struts, JSF, hibernate, EJB, Spring, ADF and webservices,I also have a great experience in javascript, Ajax, More

$100 USD in 5 days
(17 Reviews)
4.6
donhuan

Dear Hiring Manager, I'm very interested in your job post involving these skills. I have created some projects which use http request to get information on web and can use google detect languague api. I believe m More

$50 USD in 3 days
(9 Reviews)
3.3
JohnBHarris

I have stacks of experience developing applications in C/C++/C#. Recently I have been using the .NET WebClient class to download web pages and analyse their contents. I think this would be a good approach and is certai More

$200 USD in 5 days
(1 Review)
2.4
sm2mafaz

Hello There, Please have a look to the PMB for further details. Kind Regards, Mafaz

$180 USD in 10 days
(2 Reviews)
1.9
stdcall

I would be glad to make this program.

$249.99 USD in 5 days
(0 Reviews)
0.0
kalpesh080888

Please check u r pmb.

$250 USD in 5 days
(0 Reviews)
0.0
jailton

Please, check your PMB.

$100 USD in 5 days
(0 Reviews)
0.0
AliShahid83

I have worked recently on such project. Hope we can have a deal. THanks.

$60 USD in 3 days
(0 Reviews)
0.0