(Python)拆分字符串多個分隔符更有效?1) 使用多重替換方法然後使用拆分 2) 使用正則表達式 ((Python) which is more efficient to split a string multiple separators? 1) Using multiple replace method then using split 2) using regular Expressions)


問題描述

(Python)拆分字符串多個分隔符更有效?1) 使用多重替換方法然後使用拆分 2) 使用正則表達式 ((Python) which is more efficient to split a string multiple separators? 1) Using multiple replace method then using split 2) using regular Expressions)

示例:

(案例1)

#先用replace方法將不同類型的分隔符替換為單一類型,然後再用split方法

p>
text = "python is, an easy;language; to, learn."
text_one_delimiter = text.replace("# ", ", ").replace("% ", ", ").replace("; ", ", ").replace("‑ ", ", ")

print(text_one_delimiter.split(", "))

(case 2)

#使用正則表達式進行分割使用多個分隔符

import re

text = "python is# an% easy;language‑ to, learn."
print(re.split('; |, |# |% |‑ ', text))

參考解法

方法 1:

timeit module is useful for speed comparison of code snippet. It might be used following way:

import timeit
case1 = '''text = "python is, an easy;language; to, learn."
text_one_delimiter = text.replace("# ", ", ").replace("% ", ", ").replace("; ", ", ").replace("‑ ", ", ")
text_one_delimiter.split(", ")'''
case2_setup = "import re"
case2 = '''text = "python is# an% easy;language‑ to, learn."
re.split('; |, |# |% |‑ ', text)'''
print(timeit.timeit(case1))
print(timeit.timeit(case2,case2_setup))

Output (will depend on your machine):

1.1250261999999793
2.2901268999999616

Note that I excluded prints from examined code and make import re setup, as otherwise it would import it without need several time. Conclusion is that in this particular case method with multiple .replaces is faster than re.split.

(tested in Python 3.7.3)

(by Shambhav AgrawalDaweo)

參考文件

  1. (Python) which is more efficient to split a string multiple separators? 1) Using multiple replace method then using split 2) using regular Expressions (CC BY‑SA 2.5/3.0/4.0)

#split #Python #replace #python-re






相關問題

將 xml 元素內容拆分為固定行數 (Split xml element content into fix number of lines)

是否有任何標准說明“aba”.split(/a/) 是否應該返回 1,2 或 3 個元素? (Is there any standard which says if "aba".split(/a/) should return 1,2, or 3 elements?)

Cố gắng gọi các phương thức trong phương thức main với biến được khởi tạo trong các phương thức khác (Trying to call methods in main method with variable initialized in other methods)

使用 Java-Regex 與 Regex 成對拆分多行文本 (Split text with Java-Regex in pairs with Regex over several lines)

如何分割字節數組 (How to split a byte array)

String componentsSeparatedByString 做一次 (String componentsSeparatedByString do one time)

從一行文本中獲取特定數據 (Get specific data from a line of text)

(Python)拆分字符串多個分隔符更有效?1) 使用多重替換方法然後使用拆分 2) 使用正則表達式 ((Python) which is more efficient to split a string multiple separators? 1) Using multiple replace method then using split 2) using regular Expressions)

ValueError:發現樣本數量不一致的輸入變量:[2935848、2935849] (ValueError: Found input variables with inconsistent numbers of samples: [2935848, 2935849])

在 Powershell 中拆分和添加字符串 (Splitting and Adding String in Powershell)

在 python 函數中檢查月份的有效性時出錯 (Error in checking validity of month in python function)

如何將 .obj 文件拆分為其他兩個文件(python、open3d)? (How to split a .obj file into two other files (python, open3d)?)







留言討論