regular expressions library
스레드 게시자: Luna Media
Luna Media
Luna Media
미국
영어에서 스페인어
+ ...
Jun 15, 2024

My colleagues and I are trying to build a library of regular expressions to help us fix machine translation errors automatically. Before we get too deep into learning and building it, we were wondering if such a thing already exists out there. Google searches did not help locate a "library regular expressions for localization" and it would be a bit time-consuming to find examples throughout the forum, which I'm sure there are plenty. Thank you very much!

 
Cilian O'Tuama
Cilian O'Tuama  Identity Verified
독일
Local time: 09:39
독일어에서 영어
+ ...
Machine versus machine? Jun 16, 2024

Maybe I've misunderstood, but it sounds like you want to improve MT by using another machine?

Automatically replace certain terms with others? Would that improve anything?

More details please.

A list of 'regular expressions" would be endless.


 
Dan Lucas
Dan Lucas  Identity Verified
영국
Local time: 08:39
회원(2014)
일본어에서 영어
Handcraft them Jun 16, 2024

Luna Media wrote:
My colleagues and I are trying to build a library of regular expressions to help us fix machine translation errors automatically.

Personally, I don't think a library or any other kind of prebuilt collection will be of much use to you, because you will have to inspect each case individually. It will depend very much on the language pair, the content of the text, and so on.

Regexes are powerful tools, but there are lots of edge cases and they can be destructive if deployed carelessly or indiscriminately. I use them fairly frequently and can testify that is very easy to make mistakes that ruin large sections of your document. This potential for disaster may be the origin of the following well-known observation by Jamie Zawinski:

Some people, when confronted with a problem, think "I know, I'll use regular expressions." Now they have two problems.
If you really want to do this, I would abandon the idea of a pre-built library and just start crafting one example at a time, and testing very carefully (in a sandbox) as you go. That way you will at least understand what you are doing. If you're new to regexes and I suggest looking at a tool like RegexBuddy.

Regards,
Dan


Jennifer Levey
Stepan Konev
CafeTran Trainer
Darius Sciuka
Alison Jenner
 
Luna Media
Luna Media
미국
영어에서 스페인어
+ ...
주제 스타터
Thank you Jun 16, 2024

Thank you for all the suggestions and input.

 


이 포럼에 구체적으로 배정된 관리자가 없습니다.
사이트 규칙 위반을 신고하거나 도움을 받으시려면 사이트 스태프 »에게 문의


regular expressions library







TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »
Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »