Need to develop data analysis tool for 4-5 files having each 30 lac records having 100 columns and analytical logic to be defined to generate exceptional record reports.
Customer log in & ticket generation, company website to be created
Need to identify the probable duplicate records based on same/similar names, based on combination of 2-5 cells data for each records and validation of some data for each record from exiting masters
Data analysis can be in MySQL or oracle, need to know the cost of oracle license & maintenance charges, if opting for oracle for data analysis
File upload,data Validation/Analysis programme & website
5-7 type of files will be uploded in Excel/CSV/Text file/excel binary. These may be uploaded by customer or may be emailed by customer and then may be uploded by our office team or agent.
Dynamic & responsive Web site will have 18-20 pages containing home page, about us, contact us, services etc
Some master with active/inactive and unique value checks for each record files will also be managed by us which will be used validations like bank names, IFSC codes, district, states, Pin codes etc
Output report in various formats to be generated for various exceptions based on selection
Customer data must be secured and log file for view, edit, download, access etc to be maintained, each customer data must be accessed by customer or admin only. Each request will be allotted the ticket number and status of the same can be maintained and auto mailers for the same can be configured
input & output for each version/ticket separate with auto generated password & auto mail the same to customer for accessing the same.
Different PAN/Customer files validation request not to be accepted from same IP address, these needs to be tracked, to track the multiple companies data files getting validated with same customer profile
Normalisation of the data must be done before validations like removal of space, special characters etc from particular columns, removal of prefix from name like Mr, Approx 50 major validation to be done based on single cell data/column/combination of multiple cell data from single/multiple files like probable/same/similar duplicate name etc
Programming & designing should be such that certification may be possible at later stage for customer confidence.
Speed of upload & processing will be critical considering the volume and should not be error out/hang up, performace should be good.
Security Module to be implement for data security & log file, tracking of the IP address for each access to data, date time, log in details
Phase -2
data extraction from Scan copies in PDF/JPEG etc formats which will be used in data validation
Payment gateway for payments by customers
API with NSDL/others for data validation etc.
Data extraction & validation from PF,ESI, VAT,ROC etc. govt websites
Graphic Design, HTML, MySQL, PHP, Website Design
Let's discuss over freelancer Personal Message Box for the proper estimation of cost and time.
I am myself developer so you will directly work with me. No mediators. No managers. No subcontractors.
see my recent work for the technical expertise along with reviews & feedback on my profile page.
Your project is unique and would like to discuss the whole process for better understanding. Can you please ping me on chat?
I have scrapped Facebook, Twittter, Instagram, Amazon, Trip Adviser and few more e-commerce website.
I am sure that I can help you out to fix this.
Please ping me on chat to discuss the same.
Thanking you
Umesh