代码之家 › 专栏 › 技术社区 › con

Negative Lookaward在perl正则表达式中不起作用

regex-look-ahead perl regex

con · 技术社区 · 8 月前

我正在解析一个NWChem输出文件,其文本如下:

    General Information
    -------------------
  SCF calculation type: DFT
  Wavefunction type:  closed shell.
  No. of atoms     :    10
  No. of electrons :    36
   Alpha electrons :    18
    Beta electrons :    18
  Charge           :     0
  Spin multiplicity:     1
  Use of symmetry is: on ; symmetry adaption is: on 
  Maximum number of iterations:  30
  AO basis - number of functions:    95
             number of shells:    45
  Convergence on energy requested:  1.00D-06
  Convergence on density requested:  1.00D-05
  Convergence on gradient requested:  5.00D-04

      XC Information
      --------------

我已将文件保存为字符串 $str ,并将每个换行符替换为 Ñ . 上述文本在文件中出现了大约10次,所以我想用这样的东西来捕获它们 General Information :

my @capture = $str =~ m/General\s+InformationÑ
\s+[-]+Ñ
(.+(?!\-{2,})) # negative lookahead, no more than 2 "-" characters
ÑÑ\s+[-]+
/xg;

上面的正则表达式只抓取了整个文件,这是不正确的。

我也试过了 (.+(?![\-]{2,})) 哪一个也捕获的文本比它应该捕获的要多得多。

如何更改正则表达式 (.+(?!\-{2,})) 因此不超过2 - 里面允许有字符吗?

2 回复 | 直到 8 月前

ikegami Gilles Quénot 8 月前

只捕捉 General Information 部分,

my $gi = /
   ^
   \s* General[ ]Information \n  # A line with the header
   \s* -{2,} \n                  # Followed by a separator line.
   (?: .* \n (?! \s* -- ) )*     # Lines not followed by a separator.
/xm ? $& : undef;

为了分别捕获每个部分,

my @sections = /
   ^
   \s* \S[\S\h]* \n              # A line with the header.
   \s* -{2,} \n                  # Followed by a separator line.
   (?: .* \n (?! \s* -- ) )*     # Lines not followed by a separator.
/xmg;

aaa 8 月前

虽然这是可能的,但你可能可以在不使用负面前瞻的情况下使用以下内容来捕捉它:

\s*General\s+Information\s+(?:---)+-*[\s\S]+?(?:---)+-*\s*$

You can read the details here

推荐文章

Manny · 如何比较Perl中的字符串?

3 年前

BioRod · 我不能用Perl打印键和值

3 年前

user17227456 · Perl CLI代码无法追加字符串行

3 年前

LearnToBeBetter · 读取文件,搜索字符串,打印字符串

3 年前

KJ7LNW · 一些波斯语文本的宽字符印刷,但其他文本则没有

3 年前

Amit M · 如何用FFI:Platypus替换cpan Perl实用程序P5NCI

3 年前

con · 如何搜索大型数据结构并返回一系列给出特定值的键/数组?

3 年前

rohithguptha potti · 在LINUX操作系统上执行一些Perl命令时,这些模块可以在LINUX中使用,也可以不在LINUX中使用

3 年前

Tonys AnsonÄ« Misirgis · 当“网站”选项卡关闭时,服务器如何知道关闭websocket的连接

7 年前

Pranay Nanda · 使用regex解析许可证文件

7 年前