Ich versuche, eine RFC5322-E-Mail-Adresse zu analysieren. Mein Parser funktioniert in dem Sinne, dass unter den Ergebnissen einer von ihnen korrekt ist. Wie kann ich das "richtige" Ergebnis auswählen?Das richtige ReadP-Analyseergebnis auswählen
Angesichts der Zeichenfolge Foo Bar <[email protected]>
, sollte mein Parser einen Wert von Address (Just "Foo Bar") "[email protected]"
produzieren.
Alternativ könnte der Parser bei der Zeichenfolge den Wert Address Nothing "[email protected]"
ausgeben.
Der Wert mit dem eingeschlossenen Namen wird bevorzugt.
Mein Parser sieht wie folgt aus:
import Control.Applicative
import Data.Char
import qualified Data.Text as T
import Text.ParserCombinators.ReadP
onlyEmail :: ReadP Address
onlyEmail = do
skipSpaces
email <- many1 $ satisfy isAscii
skipSpaces
return $ Address Nothing (T.pack email)
withName :: ReadP Address
withName = do
skipSpaces
name <- many1 (satisfy isAscii)
skipSpaces
email <- between (char '<') (char '>') (many1 $ satisfy isAscii)
skipSpaces
return $ Address (Just $ T.pack name) (T.pack email)
rfc5322 :: ReadP Address
rfc5322 = withName <|> onlyEmail
Wenn ich den Parser mit readP_to_S rfc5322 "Foo Bar <[email protected]>"
laufen, es produziert die folgenden Ergebnisse:
[ (Address {addressName = Nothing, addressEmail = "F"},"oo Bar <[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Fo"},"o Bar <[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo"},"Bar <[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo "},"Bar <[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo B"},"ar <[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Ba"},"r <[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar"},"<[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar "},"<[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <"},"[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <f"},"[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <fo"},"[email protected]>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <foo"},"@bar.com>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},"bar.com>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},"ar.com>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},"r.com>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},".com>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},"com>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},"om>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},"m>")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]"},">")
, (Address {addressName = Just "Foo Bar", addressEmail = "[email protected]"},"")
, (Address {addressName = Just "Foo Bar ", addressEmail = "[email protected]"},"")
, (Address {addressName = Nothing, addressEmail = "Foo Bar <[email protected]>"},"")
]
In diesem Fall ist das Ergebnis, das ich möchte tatsächlich erscheint dritt- zuletzt in der Liste. Wie drücke ich diese Präferenz aus?