匹配前后的Grep字符?

小开

你可以用

awk '/test_pattern/ {
match($0, /test_pattern/); print substr($0, RSTART - 10, RLENGTH + 20);
}' file

小开

最佳答案

前3个字符，后4个字符

$> echo "some123_string_and_another" | grep -o -P '.{0,3}string.{0,4}'
23_string_and

小开

你的意思是这样的:

grep -o '.\{0,20\}test_pattern.\{0,20\}' file

?

这将在test_pattern的两侧打印最多20个字符。\{0,20\}符号类似于*，但指定了0到20次重复，而不是0或更多。-o表示只显示匹配项本身，而不是整行。

小开

grep -E -o ".{0,5}test_pattern.{0,5}" test.txt

这将匹配最多5个字符前后你的模式。-o开关告诉grep只显示匹配项，-E则告诉grep使用扩展正则表达式。请确保在表达式周围加上引号，否则它可能会被shell解释。

小开

对于gawk，你可以使用match函数:

    x="hey there how are you"
echo "$x" |awk --re-interval '{match($0,/(.{4})how(.{4})/,a);print a[1],a[2]}'
ere   are

如果你对perl没问题，更灵活的解决方案:Following将在模式之前打印3个字符，然后是实际的模式，然后在模式之后打印5个字符。

echo hey there how are you |perl -lne 'print "$1$2$3" if /(.{3})(there)(.{5})/'
ey there how

这也可以应用于单词，而不仅仅是字符。下面将在实际匹配的字符串之前打印一个单词。

echo hey there how are you |perl -lne 'print $1 if /(\w+) there/'
hey

下面将在模式后面打印一个单词:

echo hey there how are you |perl -lne 'print $2 if /(\w+) there (\w+)/'
how

下面将在模式之前打印一个单词，然后是实际单词，然后是模式之后打印一个单词:

echo hey there how are you |perl -lne 'print "$1$2$3" if /(\w+)( there )(\w+)/'
hey there how

小开

您可以使用regexp grep来查找+第二个grep来突出显示

echo "some123_string_and_another" | grep -o -P '.{0,3}string.{0,4}' | grep string

23 _string_and

小开

我永远不会轻易记住这些神秘的命令修饰符，所以我把上面的回答变成了我的~/.bashrc文件中的一个函数:

cgrep() {
# For files that are arrays 10's of thousands of characters print.
# Use cpgrep to print 30 characters before and after search pattern.
if [ $# -eq 2 ] ; then
# Format was 'cgrep "search string" /path/to/filename'
grep -o -P ".{0,30}$1.{0,30}" "$2"
else
# Format was 'cat /path/to/filename | cgrep "search string"
grep -o -P ".{0,30}$1.{0,30}"
fi
} # cgrep()

下面是它实际运行的样子:

$ ll /tmp/rick/scp.Mf7UdS/Mf7UdS.Source


-rw-r--r-- 1 rick rick 25780 Jul  3 19:05 /tmp/rick/scp.Mf7UdS/Mf7UdS.Source


$ cat /tmp/rick/scp.Mf7UdS/Mf7UdS.Source | cgrep "Link to iconic"


1:43:30.3540244000 /mnt/e/bin/Link to iconic S -rwxrwxrwx 777 rick 1000 ri


$ cgrep "Link to iconic" /tmp/rick/scp.Mf7UdS/Mf7UdS.Source


1:43:30.3540244000 /mnt/e/bin/Link to iconic S -rwxrwxrwx 777 rick 1000 ri

这个文件是一个连续的25K行，使用常规的grep是不可能找到你要找的东西的。

注意调用与grep方法并行的cgrep的两种不同方法。

有一个&;niftier"创建函数的方法，其中"$2"仅在设置时传递，这将节省4行代码。不过我手边没有。类似${parm2} $parm2。如果我找到了，我会修改函数和这个答案。

小开

使用ugrep，你可以用选项-o (--only-matching)指定-ABC上下文，在匹配之前和/或之后显示匹配的额外上下文字符，在指定的-ABC宽度内拟合匹配加上上下文。例如:

ugrep -o -C30 pattern testfile.txt

给:

     1: ... long line with an example pattern to match.  The line could...
2: ...nother example line with a pattern.

在一个带有颜色高亮的终端上，同样的情况会给出:

一行中的多个匹配项显示为[+nnn more]:

或使用选项-k (--column-number)分别显示每个对象的上下文和列号:

上下文宽度是显示的Unicode字符的数量(UTF-8/16/32)，而不仅仅是ASCII

小开

我个人做的事情类似于张贴的答案。但由于点键，像任何键盘键，可以点击或按住..我通常不需要很多上下文(如果我需要更多，我可能会像grep -C一样做行，但经常像你一样，我不希望之前和之后的行)，所以我发现它更快地输入命令，只需点击点键有多少个点/多少个字符，如果它是几个，然后点击键，或按住它更多。

例如echo zzzabczzzz | grep -o '.abc..'

会有abc模式，前后有一个点。(在正则表达式语言中，Dot匹配任何字符)。其他人也使用点，但用花括号表示重复。

如果我想在(0或x)字符和恰好y字符之间进行严格的re，那么我会使用curlies..和-P，就像其他人做的那样。

有一个关于点是否匹配新行的设置，但如果它是一个关注/兴趣，你可以查看一下。

小开

如果使用ripgreg，你会这样做:

grep -E -o ".{0,5}test_pattern.{0,5}" test.txt