I think you meant extract emails from document, right? I'd probably find `@` and iterate before and after unless I get posix punct, space, characters. But it'll probably give some false matches. So its really hard to find 100% emails from an arbitrary text. This is because valid email can contain many different type of characters. According to RFC 822 space is a valid character in email. So finding all the valid emails is tough. In a *trivial situation* an email would be separated by space. So find @ first. Then go back and front to find the first space. You'll get most common emails. Something like using this regex pattern [^[:space:]<@]+@[^[:space:]>]+ would suffice. But keep in mind, it'll work on trivial cases. Not on special cases. Regular expression can not be used on special cases. Here is full RFC-822 compliant email matching regular expression http://ex-parrot.com/~pdw/Mail-RFC822-Address.html
More information can be found on http://stackoverflow.com/questions/201323/using-a-regular-expression-to-validate-an-email-address On Sat, Jan 26, 2013 at 10:24 PM, Tedd Sperling <[email protected]> wrote: > Hi gang: > > I thought I had a function to strip emails from a document, but I can't > find it. > > So, before I start writing a common script, do any of you have a simple > script to do this? > > Here's an example of the problem: > > Before: > > "Will Alex" <[email protected]>;"Moita Zact" <[email protected]>;"Bob > Arms" <[email protected]>;"Meia Terms" <[email protected]>; > > After: > > [email protected] > [email protected] > [email protected] > [email protected] > > Cheers, > > tedd > > > _____________________ > [email protected] > http://sperling.com > > > -- > PHP General Mailing List (http://www.php.net/) > To unsubscribe, visit: http://www.php.net/unsub.php > > -- Shiplu.Mokadd.im ImgSign.com | A dynamic signature machine Innovation distinguishes between follower and leader

