pyparsing 與 regex 結合表達漢字等Unicode字符（qbit） Detail - 正則表達式 qbit Blog

前言

技術棧

python      3.11.8
pyparsing   3.2.1
regex       2024.11.6

測試案例

測試代碼

# encoding: utf-8
# author: qbit
# date: 2025-02-05
# summary: 使用 pyparsing 和 sympy 化簡與或非邏輯表達式

import pyparsing as pp
import regex
# greet = pp.Word(pp.alphas) + "," + pp.Regex(r'\p{Han}+') + "!"
# 上面一句會報錯，因為 pyparsing 默認使用的內置 re 庫不支持 Unicode 正則
greet = pp.Word(pp.alphas) + "," + pp.Regex(regex.compile(r'\p{Han}+')) + "!"
hello = "Hello, 中國!"
print(hello, "->", greet.parseString(hello))

測試輸出

Hello, 中國! -> ['Hello', ',', '中國', '!']

qbit Blog

@q_bit

Tags

Python (289)

Linux (168)

redis (93)

Docker (89)

elasticsearch (48)

websocket (36)

日誌 (31)

shell (29)

fastapi (28)

ast (24)

async (23)

正則表達式 (21)

Stories

pyparsing 與 regex 結合表達漢字等Unicode字符（qbit） - Stories Detail

前言

測試案例

相關閲讀

Add a new Comments