highspeed text mapping functions written in PYTHON and Javascript (will award 2 people). Read b4 bidding
$250-750 USD
Betalades vid leverans
Important. String matching will not work for these functions as they datasets are 100K+ in size. The data for the mapping functions must be loaded into a binary tree so searching is fast. Do not bid if you can't do this. We have code in Delphi that does all these functions as a guide. However, is ok to start from scratch. Designing an efficient data structure is 1/3 of the challenge, traversing/searching it is another 1/3. ALL the functions make use of this initial structure. Get that down, and most of functions are the same, but with different data loaded.
Here are the mapping functions needed, all use the same function, just different data.
Translate first: Input( first name), output( translated first name). This is an exact case insensitive mapping, about 40,000 names. Mappings are saved in text file, simple lookup and string replacement.
Translate last: Input( last name), output(translated last name). This is an exact case insensitive mapping, about 40,000 names. Mappings are saved in text file, simple lookup and string replacement.
Translate title: Input( title), output(translated title). This is an exact case insensitive mapping, about 100,000 titles. Mappings are saved in text file, simple lookup and string replacement.
All translation data is in one file [login to view URL] (example file with comments attached)
***********************
Segment function:
Segment title: input( title) , output ( title level, title-level-number, title department, organization bracket).
-to get title-level: Search for title-levels as substring in provided title level. Exit search on first match
-to get title-dept: Search for title-dept as substring in provided title department. Exit search on first match
-to get title-level-number: use corresponding integer from title-level (in section header)
-to get organization-bracket: defined in [Org-Brackets] section in [login to view URL]
2 text files hold the data for segmenting the titles. [login to view URL], [login to view URL]
Reminder, there will be 100K+ entries
**********************
Function Normalize: Input(text), output(normalized text)
normalize instructions: there are 5 types commands. The first command is MAP. If a match is found, searching is stopped and the function is exited.
function: MAPRANGE: Maps ranges of values to a single value, example:
MapRange={1..24},{<25}
MapRange={25..99},{25 to 99}
MapRange={100..499},{100 to 499}
MapRange={500..999},{500 to 999}
MapRange={1000..},{1000+}
MapRange={},{Unavailable}
For the other 3, every command is processed if there is a match. (WS, SR, RE). They are listed 1 per line in the file
; WS = Word Swap, Swap the whole word for a mapped word
; Example
; replace "Chrm." with "Chairman"
; WS={Chairman},{Chrm.}
;RE = run regex on word, example format:
; example
; remove * \ . ? - SPACE from start of line (Spaces = \x20 \xA0 \xAD)
RE={(^[_\!\*\\\.\-\?\x20\xA0\xAD]+)}
;SR = search/replace
;example
;replace -> with -
SR={-},{->}
****
Be clear and put either the word JAVASCRIPT or PYTHON as the first word in your bid. To make sure you read this full posting, put your favorite color as the second word in your bid. ie "JAVASCRIPT BLUE".
Milestones will be
1. Binary data structure & Translation functions
2. Segment function
3. Normalize function SR, MAP
4. Normalize function, RE, WS
5. Normalize function MAPRANGE
The functions will be tested on a web page form that enters data in fields. Parsed data shown as output
We would like to spend max $750 on each language.
Good luck!
Projekt-id: #28497929
About the project
8 frilansare har lagt bud på i genomsnitt $624 för det här jobbet
PYTHON BLUE I'm a senior python expert, with 5+ years of experience. When do I have to finish this? I look forward to working on this project with you. Please contact me so we discuss details. Regards. Ekaterina
Hello, Hope this message finds you well, I checked your details and I believe that my experience is what you are looking 4. I have been working on similar projects for the past eight years, and I have the essential sk Mer
PYTHON RED For the inital data structures, I plan on using a red-black tree with lexicographic values of the strings as their keys. This will guarantee an O(log n) runtime to make all the other functions easier. I coul Mer
Hi, Donato. I have rich experience with PYTHON and Javascript development. I can help you perfectly. I just checked your project description carefully. So it is an ideal match for my skill and motivation for me. Please Mer