regex
Ah but if I do that in Java, it takes a lot longer. Basically the scene is - I've got a web crawler looking for news written in Java using multiple JVMs and it needs to recognise certain terms to dump in the DB. Because its multithreaded the algorithm needs to be as tight as possible or else it'll bottleneck.
- Visitors can check out the Forum FAQ by clicking this link. You have to register before you can post: click the REGISTER link above to proceed. To start viewing messages, select the forum that you want to visit from the selection below. View our Forum Privacy Policy.
- Want to receive the latest contracting news and advice straight to your inbox? Sign up to the ContractorUK newsletter here. Every sign up will also be entered into a draw to WIN £100 Amazon vouchers!
Reply to: Regular Expressions
Collapse
You are not logged in or you do not have permission to access this page. This could be due to one of several reasons:
- You are not logged in. If you are already registered, fill in the form below to log in, or follow the "Sign Up" link to register a new account.
- You may not have sufficient privileges to access this page. Are you trying to edit someone else's post, access administrative features or some other privileged system?
- If you are trying to post, the administrator may have disabled your account, or it may be awaiting activation.
Logging in...
Previously on "Regular Expressions"
Collapse
-
Guest replied
-
Guest repliedRe: reex
Code:foreach $s (" Lens ", " Lens,", " Lens.", " Lens'", " Lens\\", "Lens", "Lens ", " Lens" ) { if ($s =~ / Lens[ ,\.']/) { print "--$s--\n"; } } -- Lens -- -- Lens,-- -- Lens.-- -- Lens'--
Leave a comment:
-
Guest repliedreex
Reynolds
dont forget about start of line conditions ie
blah blah balh.
Lens is a wonderful place.
blah blah
Of course I forgot java will itself strip \\ down to \ so you were right about the \\. to match a dot. Sorry havent done any java for a long time, I tend to use regex in awk.
there is a good reference - look at the tutorials at
www.javaregex.com specifically tutorial 3.
Leave a comment:
-
Guest repliedre
cheers whats. Perl, the space should be at the start of the 4th as well.
Leave a comment:
-
Guest repliedRe: regex
cant be arsed to optimise but the following Perl regex should work:
my $RegExp=" Lens( |\,|\.|\')";
If its performance critical (did not sound like) I'd do it in two stages:
my $RegExp=" Lens.";
.' will match any symbol, and in case of match I'd then check using simple switch what the symbol is.
oh year Perl's question regarding trailing space in 4th stands, I assumed space is required in all cases.
Leave a comment:
-
Guest repliedRe: regex
Are the leading spaces in the first three cases mandatory (and not in the fourth)?
Leave a comment:
-
Guest repliedregex
of the top of my head cos I cant be bothered to check it
you need java.util.regex and I think the pattern
"[ ]*Lens[ ,\.\']+"
Id expect to find a fullstop with \. \\. I would expect to find a slash folloed by any character.
Leave a comment:
-
Guest started a topic Regular ExpressionsRegular Expressions
I'm looking for a way to match the city Lens in a variety of texts. It has to match " Lens ", " Lens,", " Lens." and "Lens'"
Does anyone know the best way of doing this with a java/perl regular expression?
Is this any good?
String j="Lens";
String pattern = " "+j+" | "+j+",| "+j+"\\.|"+j+"'";
And to find any word with a fullstop following does it have to have the fullstop like this \\. ?Tags: None
- Home
- News & Features
- First Timers
- IR35 / S660 / BN66
- Employee Benefit Trusts
- Agency Workers Regulations
- MSC Legislation
- Limited Companies
- Dividends
- Umbrella Company
- VAT / Flat Rate VAT
- Job News & Guides
- Money News & Guides
- Guide to Contracts
- Successful Contracting
- Contracting Overseas
- Contractor Calculators
- MVL
- Contractor Expenses
Advertisers
Contractor Services
CUK News
- Contractors, don’t be fooled by HMRC Spotlight 67 on MSCs Today 09:20
- HMRC warns IT consultants and others of 12 ‘payroll entities’ Yesterday 09:15
- How you think you look on LinkedIn vs what recruiters see Dec 2 09:00
- Reports of umbrella companies’ death are greatly exaggerated Nov 28 10:11
- A new hiring fraud hinges on a limited company, a passport and ‘Ade’ Nov 27 09:21
- Is an unpaid umbrella company required to pay contractors? Nov 26 09:28
- The truth of umbrella company regulation is being misconstrued Nov 25 09:23
- Labour’s plan to regulate umbrella companies: a closer look Nov 21 09:24
- When HMRC misses an FTT deadline but still wins another CJRS case Nov 20 09:20
- How 15% employer NICs will sting the umbrella company market Nov 19 09:16
Leave a comment: