Exclude URLs from Graph View based on substring

What I’m trying to do

I am using a web clipper that imports webpages. Many of these are Wikipedia pages. Wikipedia pages almost always include a bunch of URLs with the format /wiki/pagename . I’d like to exclude just these URLs—i.e., ones that that contain “wiki” in their string—from my Graph View, but keep the rest.

Things I have tried

Over an hour (a) searching this forum, reddit, and Google for an answer; and (b) experimenting with regular expressions and such but with no success. Using -file:wiki and -path:wiki get rid of too much, such as existing files that have “wiki” in their names. Adding the slash—e.g., using -path:/wiki—ends up excluding nothing. Escaping the slash with backslash doesn’t make any difference.

that’s what you need; the regex is: /\/wiki\/.+/
never used graph so don’t know if how urls show up or not, sorry
so prolly you’d put - in front of the regex given…?

sorry, didn’t try this but the regex will match

1 Like

Hello! Thanks for replying. I’ve tried every legal variation of your regex but it does not work. Can you try it out?

then /.*?\/wiki\/.+/ with dash in front right should do it

otherwise why don’t you upload in a zip a folder of some of these files so ppl can see more to be able to help… just an idea…

This topic was automatically closed 90 days after the last reply. New replies are no longer allowed.