Collect proper names daily on a French news website and create a database with their frequency
$30-80 USD
Completed
Posted almost 13 years ago
$30-80 USD
Paid on delivery
<[login to view URL]> is a French reference newspaper.
I want a software to daily "read" about 30 free articles, and collect their proper names (Abidjan, Fukushima...).
They are then written in a database and sorted by frequency.
Details:
- articles are found in menus Actualit?s and D?bats. Each thumbnail of each sub-category must be checked. Example, for Actualit?s\International: International, Afrique, Europe, Am?riques, Asie-Pacique, Proche-Orient. (Not all categories have thumbnails. Some thumbnails have new articles daily, most of them rather every second or third day).
- The program has to recognize proper names, possibly by checking a database of french common names. A proper name has a capitalized first letter, but not all words with a capitalized first letter are proper names. In the following example, "Il" is not a proper name, "Churchill" and "Londres" are. "Il a rencontr? Churchill dans une petite ville. Londres ?tait trop encombr?."
## Deliverables
* * *This broadcast message was sent to all bidders on Saturday Apr 30, 2011 1:57:41 AM:
Dear bidders, The best offer from a well-rated coder is currently $180. If you haven't given your price or want to change it, please do it within 72 hours. Finally, it's not necessary to take into account all articles, RSS threads found on page [login to view URL],48-0,1-0,[login to view URL] will be good enough. Regards.