| Copyright | (c) Chris Kuklewicz 2006 | 
|---|---|
| License | BSD-style (see the file LICENSE) | 
| Maintainer | libraries@haskell.org, textregexlazy@personal.mightyreason.com | 
| Stability | experimental | 
| Portability | non-portable (regex-base needs MPTC+FD) | 
| Safe Haskell | None | 
| Language | Haskell98 | 
Text.Regex.Posix.ByteString
Contents
Description
This provides ByteString instances for RegexMaker and RegexLike
 based on Text.Regex.Posix.Wrap, and a (RegexContext Regex
 ByteString ByteString) instance.
To use these instance, you would normally import Text.Regex.Posix. You only need to import this module to use the medium level API of the compile, regexec, and execute functions. All of these report error by returning Left values instead of undefined or error or fail.
The ByteString will only be passed to the library efficiently (as a pointer) if it ends in a NUL byte. Otherwise a temporary copy must be made with the 0 byte appended.
- data Regex
- type MatchOffset = Int
- type MatchLength = Int
- data ReturnCode
- type WrapError = (ReturnCode, String)
- unusedOffset :: Int
- compile :: CompOption -> ExecOption -> ByteString -> IO (Either WrapError Regex)
- execute :: Regex -> ByteString -> IO (Either WrapError (Maybe (Array Int (MatchOffset, MatchLength))))
- regexec :: Regex -> ByteString -> IO (Either WrapError (Maybe (ByteString, ByteString, ByteString, [ByteString])))
- newtype CompOption = CompOption CInt
- compBlank :: CompOption
- compExtended :: CompOption
- compIgnoreCase :: CompOption
- compNoSub :: CompOption
- compNewline :: CompOption
- newtype ExecOption = ExecOption CInt
- execBlank :: ExecOption
- execNotBOL :: ExecOption
- execNotEOL :: ExecOption
Types
type MatchOffset = Int #
0 based index from start of source, or (-1) for unused
type MatchLength = Int #
non-negative length of a match
data ReturnCode #
ReturnCode is an enumerated CInt, corresponding to the error codes
 from man 3 regex:
- retBadbr(- REG_BADBR) invalid repetition count(s) in- { }
- retBadpat(- REG_BADPAT) invalid regular expression
- retBadrpt(- REG_BADRPT)- ?,- *, or- +operand invalid
- retEcollate(- REG_ECOLLATE) invalid collating element
- retEctype(- REG_ECTYPE) invalid character class
- retEescape(- REG_EESCAPE)- \applied to unescapable character
- retEsubreg(- REG_ESUBREG) invalid backreference number
- retEbrack(- REG_EBRACK) brackets- [ ]not balanced
- retEparen(- REG_EPAREN) parentheses- ( )not balanced
- retEbrace(- REG_EBRACE) braces- { }not balanced
- retErange(- REG_ERANGE) invalid character range in- [ ]
- retEspace(- REG_ESPACE) ran out of memory
- retNoMatch(- REG_NOMATCH) The regexec() function failed to match
Instances
type WrapError = (ReturnCode, String) #
The return code will be retOk when it is the Haskell wrapper and not the underlying library generating the error message.
Miscellaneous
unusedOffset :: Int #
Medium level API functions
Arguments
| :: CompOption | Flags (summed together) | 
| -> ExecOption | Flags (summed together) | 
| -> ByteString | The regular expression to compile | 
| -> IO (Either WrapError Regex) | Returns: the compiled regular expression | 
Compiles a regular expression
Arguments
| :: Regex | Compiled regular expression | 
| -> ByteString | String to match against | 
| -> IO (Either WrapError (Maybe (Array Int (MatchOffset, MatchLength)))) | Returns:  | 
Matches a regular expression against a buffer, returning the buffer indicies of the match, and any submatches
| Matches a regular expression against a string
Arguments
| :: Regex | Compiled regular expression | 
| -> ByteString | String to match against | 
| -> IO (Either WrapError (Maybe (ByteString, ByteString, ByteString, [ByteString]))) | 
Compilation options
newtype CompOption #
A bitmapped CInt containing options for compilation of regular
 expressions.  Option values (and their man 3 regcomp names) are
- compBlankwhich is a completely zero value for all the flags. This is also the- blankCompOptvalue.
- compExtended(REG_EXTENDED) which can be set to use extended instead of basic regular expressions. This is set in the- defaultCompOptvalue.
- compNewline(REG_NEWLINE) turns on newline sensitivity: The dot (.) and inverted set- [^ ]never match newline, and ^ and $ anchors do match after and before newlines. This is set in the- defaultCompOptvalue.
- compIgnoreCase(REG_ICASE) which can be set to match ignoring upper and lower distinctions.
- compNoSub(REG_NOSUB) which turns off all information from matching except whether a match exists.
Constructors
| CompOption CInt | 
Instances
compBlank :: CompOption #
A completely zero value for all the flags.
 This is also the blankCompOpt value.
compNoSub :: CompOption #
Execution options
newtype ExecOption #
A bitmapped CInt containing options for execution of compiled
 regular expressions.  Option values (and their man 3 regexec names) are
- execBlankwhich is a complete zero value for all the flags. This is the blankExecOpt value.
- execNotBOL(REG_NOTBOL) can be set to prevent ^ from matching at the start of the input.
- execNotEOL(REG_NOTEOL) can be set to prevent $ from matching at the end of the input (before the terminating NUL).
Constructors
| ExecOption CInt | 
Instances
execBlank :: ExecOption #
A completely zero value for all the flags.
 This is also the blankExecOpt value.