如何解决仅使用 BeautfulSoup
我有一个很大的讲义 html 文件,我想通过各种定义、定理等来拆分它。我已经设法做到了这一点,但是当我使用 .get_text() 函数时,它同时获得了 unicode和 LaTeX 代码,有没有(优雅的)分割这些的方式?
示例:
定义的原始 HTML:
'''
<div class="ltx_theorem ltx_theorem_definition" id="Ch1.S1.Thmtheorem1">
<h6 class="ltx_title ltx_runin ltx_title_theorem">
<span class="ltx_tag ltx_tag_theorem"><span class="ltx_text ltx_font_bold">Definition 1.1.1</span></span> (Groups: First definition).</h6>
<div class="ltx_para" id="Ch1.S1.Thmtheorem1.p1">
<p class="ltx_p">A <span class="ltx_text ltx_font_bold">group</span> <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="8" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.Thmtheorem1.p1.m1"><mjx-semantics><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c28"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n"><mjx-c class="mjx-c2C"></mjx-c></mjx-mo><mjx-mo class="mjx-n" space="2"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mo class="mjx-n"><mjx-c class="mjx-c29"></mjx-c></mjx-mo></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="(G,\circ)" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mo stretchy="false">(</mo><mi>G</mi><mo>,</mo><mo>∘</mo><mo stretchy="false">)</mo></mrow><annotation encoding="application/x-tex">(G,\circ)</annotation></semantics></math></mjx-assistive-mml></mjx-container> is a non-empty set <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="9" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.Thmtheorem1.p1.m2"><mjx-semantics><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mi>G</mi><annotation encoding="application/x-tex">G</annotation></semantics></math></mjx-assistive-mml></mjx-container> together with a binary operation <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="10" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.Thmtheorem1.p1.m3"><mjx-semantics><mjx-mo class="mjx-n"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\circ" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mo>∘</mo><annotation encoding="application/x-tex">\circ</annotation></semantics></math></mjx-assistive-mml></mjx-container> – called the <span class="ltx_text ltx_font_bold">“group law”</span> – satisfying:</p>
<ol class="ltx_enumerate" id="Ch1.S1.I1">
<li class="ltx_item" id="Ch1.S1.I1.i1">
<div class="ltx_para" id="Ch1.S1.I1.i1.p1">
<p class="ltx_p"><span class="ltx_text ltx_font_bold">Closure</span>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="11" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i1.p1.m1"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2200"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2C"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="2"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\forall x,y\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mrow><mo>∀</mo><mi>x</mi></mrow><mo>,</mo><mi>y</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">\forall x,y\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="12" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i1.p1.m2"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="x\circ y\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mi>x</mi><mo>∘</mo><mi>y</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">x\circ y\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>.</p>
</div>
</li>
<li class="ltx_item" id="Ch1.S1.I1.i2">
<div class="ltx_para" id="Ch1.S1.I1.i2.p1">
<p class="ltx_p"><span class="ltx_text ltx_font_bold">Associativity</span>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="13" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i2.p1.m1"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2200"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2C"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="2"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n"><mjx-c class="mjx-c2C"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="2"><mjx-c class="mjx-c1D467 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\forall x,y,z\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mrow><mo>∀</mo><mi>x</mi></mrow><mo>,</mo><mi>y</mi><mo>,</mo><mi>z</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">\forall x,z\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="14" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i2.p1.m2"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c28"></mjx-c></mjx-mo><mjx-mrow><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c29"></mjx-c></mjx-mo></mjx-mrow><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D467 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c3D"></mjx-c></mjx-mo><mjx-mrow space="4"><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mrow space="3"><mjx-mo class="mjx-n"><mjx-c class="mjx-c28"></mjx-c></mjx-mo><mjx-mrow><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D467 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c29"></mjx-c></mjx-mo></mjx-mrow></mjx-mrow></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="(x\circ y)\circ z=x\circ(y\circ z)" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mrow><mo stretchy="false">(</mo><mrow><mi>x</mi><mo>∘</mo><mi>y</mi></mrow><mo stretchy="false">)</mo></mrow><mo>∘</mo><mi>z</mi></mrow><mo>=</mo><mrow><mi>x</mi><mo>∘</mo><mrow><mo stretchy="false">(</mo><mrow><mi>y</mi><mo>∘</mo><mi>z</mi></mrow><mo stretchy="false">)</mo></mrow></mrow></mrow><annotation encoding="application/x-tex">(x\circ y)\circ z=x\circ(y\circ z)</annotation></semantics></math></mjx-assistive-mml></mjx-container>.</p>
</div>
</li>
<li class="ltx_item" id="Ch1.S1.I1.i3">
<div class="ltx_para" id="Ch1.S1.I1.i3.p1">
<p class="ltx_p"><span class="ltx_text ltx_font_bold">Existence of identity element</span>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="15" jax="CHTML" role="presentation" style="font-size: 101.3%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i3.p1.m1"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2203"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D452 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\exists e\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mo>∃</mo><mi>e</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">\exists e\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>,<mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="16" jax="CHTML" role="presentation" style="font-size: 101.3%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i3.p1.m2"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2200"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\forall x\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mo>∀</mo><mi>x</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">\forall x\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="17" jax="CHTML" role="presentation" style="font-size: 101.3%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i3.p1.m3"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D452 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c3D"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c3D"></mjx-c></mjx-mo><mjx-mrow space="4"><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D452 TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="e\circ x=x=x\circ e" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mi>e</mi><mo>∘</mo><mi>x</mi></mrow><mo>=</mo><mi>x</mi><mo>=</mo><mrow><mi>x</mi><mo>∘</mo><mi>e</mi></mrow></mrow><annotation encoding="application/x-tex">e\circ x=x=x\circ e</annotation></semantics></math></mjx-assistive-mml></mjx-container>.</p>
</div>
</li>
<li class="ltx_item" id="Ch1.S1.I1.i4">
<div class="ltx_para" id="Ch1.S1.I1.i4.p1">
<p class="ltx_p"><span class="ltx_text ltx_font_bold">Existence of inverses</span>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="18" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i4.p1.m1"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2200"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\forall x\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mo>∀</mo><mi>x</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">\forall x\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>,<mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="19" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i4.p1.m2"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mo class="mjx-n"><mjx-c class="mjx-c2203"></mjx-c></mjx-mo><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c2208"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D43A TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="\exists y\in G" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mo>∃</mo><mi>y</mi></mrow><mo>∈</mo><mi>G</mi></mrow><annotation encoding="application/x-tex">\exists y\in G</annotation></semantics></math></mjx-assistive-mml></mjx-container>: <mjx-container class="MathJax CtxtMenu_Attached_0" ctxtmenu_counter="20" jax="CHTML" role="presentation" style="font-size: 101.1%; position: relative;" tabindex="0"><mjx-math aria-hidden="true" class="ltx_Math MJX-TEX" id="Ch1.S1.I1.i4.p1.m3"><mjx-semantics><mjx-mrow><mjx-mrow><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi></mjx-mrow><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c3D"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="4"><mjx-c class="mjx-c1D452 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="4"><mjx-c class="mjx-c3D"></mjx-c></mjx-mo><mjx-mrow space="4"><mjx-mi class="mjx-i"><mjx-c class="mjx-c1D466 TEX-I"></mjx-c></mjx-mi><mjx-mo class="mjx-n" space="3"><mjx-c class="mjx-c2218"></mjx-c></mjx-mo><mjx-mi class="mjx-i" space="3"><mjx-c class="mjx-c1D465 TEX-I"></mjx-c></mjx-mi></mjx-mrow></mjx-mrow></mjx-semantics></mjx-math><mjx-assistive-mml display="inline" role="presentation" unselectable="on"><math alttext="x\circ y=e=y\circ x" class="ltx_Math" display="inline" xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mrow><mi>x</mi><mo>∘</mo><mi>y</mi></mrow><mo>=</mo><mi>e</mi><mo>=</mo><mrow><mi>y</mi><mo>∘</mo><mi>x</mi></mrow></mrow><annotation encoding="application/x-tex">x\circ y=e=y\circ x</annotation></semantics></math></mjx-assistive-mml></mjx-container>.
</p>
</div>
</li>
</ol>
</div>
</div>
'''
然后运行 .get_text() 我们得到:
定义 1.1.1(组:第一个定义)。
一个群 (G,∘)(G,\circ) 是一个非空集 GG 和一个二进制 操作∘\circ——称为“群律”——满足:
闭包:∀x,y∈G\forall x,y\in G:x∘y∈Gx\circ y\in G。
结合性:∀x,z∈G\forall x,z\in G: (x∘y)∘z=x∘(y∘z)(x\circ y)\circ z=x\circ(y\circ z).
恒等元存在:∃e∈G\exists e\in G,∀x∈G\forall x\in G: e∘x=x=x∘ee\circ x=x=x\circ e.
逆的存在性:∀x∈G\forall x\in G,∃y∈G\exists y\in G: x∘y=e=y∘xx\circ y=e=y\circ x。
想要的输出:
定义 1.1.1(组:第一个定义)。
一个群 (G,\circ) 是一个非空集 G 和一个二进制 操作\circ——称为“群律”——满足:
闭合:\forall x,y\in G:x\circ y\in G。
结合性:\forall x,z\in G: (x\circ y)\circ z=x\circ(y\circ z).
存在单位元素:\exists e\in G,\forall x\in G: ee\circ x=x=x\circ e.
存在逆:\forall x\in G,\exists y\in G: x\circ y=e=y\circ x.
最初,我将文本保存到文件中,然后手动编辑它,我正在寻找更优雅的解决方案,因为这只是我第二次使用 bs4 模块,但不知道该看什么因为在文档中。
解决方法
我的方法(没有任何外部库)将沿着这些路线:
import bs4
import re
html = '''
#Insert the long HTML provided above
'''
els = soup.find_all('p')
for el in els:
string = re.sub('[ \n]+',' ',el.text).strip()
print(string)
基本上,您正在寻找所有段落元素(其中存储了相应的文本元素),您遍历它们并单独剥离每个元素(取决于您想要删除的格式 - 使用正则表达式)。 最后,打印字符串。
版权声明:本文内容由互联网用户自发贡献,该文观点与技术仅代表作者本人。本站仅提供信息存储空间服务,不拥有所有权,不承担相关法律责任。如发现本站有涉嫌侵权/违法违规的内容, 请发送邮件至 dio@foxmail.com 举报,一经查实,本站将立刻删除。