Filter won't accept regexp

Tips on writing regular expressions for searching the post list

Moderators: Quade, dexter

Filter won't accept regexp

Postby Takamatsu » Sun Sep 07, 2003 12:17 pm

Okay, I have tested this string in the filter bar, and it works, but when I try to add this to the File Filters, it refuses to add.

JUBEI.CHAN(.|..|...)(0[1-9]|1[0-3])

I tried this and this adds successfully
JUBEI.CHAN.(0[1-9]|1[0-3])


Any ideas?

Basically I'm trying to compensate for misc characters between the title "Jubei Chan" and the episode number (1-13)

For example... I want to filter out "Jubei-chan - Ep 13" but not
"Jubei-chan 14"
User avatar
Takamatsu
Active Participant
Active Participant
 
Posts: 69
Joined: Sat Aug 16, 2003 12:50 pm

Registered Newsbin User since: 04/08/03

Postby Quade » Sun Sep 07, 2003 12:46 pm

Use ".*" between the sections.

"JUBEI.CHAN.*(0[1-9]|1[0-3])" Should work (I didn't test though).

Typically in this case I use a simple "Find" like "JUBEI". then mark the ones I don't want and hit the delete key. Or I column sort to get them in episode order.
User avatar
Quade
Eternal n00b
Eternal n00b
 
Posts: 44984
Joined: Sat May 19, 2001 12:41 am
Location: Virginia, US

Registered Newsbin User since: 10/24/97

Postby Takamatsu » Sun Sep 07, 2003 9:24 pm

Okay... the problem with this.

I am putting this in the "Reject File Filters"

JUBEI.CHAN.*(0[1-9]|1[0-3])

The idea is to eliminate episodes 1-13... so episode #14 gets posted.

Here's a sample subject line.

Jubei-Chan Full Repost [15/26] - "Jubei-Chan - Ep 14 [P-A].part12.rar" yEnc (01/51)

This like would be rejected because of the 12, even if I really wanted this line.
I'm only concerning about this, if I leave the robot running in auto mode, when I have to leave the computer for a extended period of time. (My ISP only offers a one day retention, so I sorta start panicing when I have to leave for a weekend)
User avatar
Takamatsu
Active Participant
Active Participant
 
Posts: 69
Joined: Sat Aug 16, 2003 12:50 pm

Registered Newsbin User since: 04/08/03

Postby Quade » Sun Sep 07, 2003 10:43 pm

Why not just put "Jubei-Chan - Ep 14" in the reject filter?
User avatar
Quade
Eternal n00b
Eternal n00b
 
Posts: 44984
Joined: Sat May 19, 2001 12:41 am
Location: Virginia, US

Registered Newsbin User since: 10/24/97

Postby Smite » Mon Sep 08, 2003 3:02 am

Takamatsu wrote:Okay... the problem with this.

I am putting this in the "Reject File Filters"

JUBEI.CHAN.*(0[1-9]|1[0-3])

The idea is to eliminate episodes 1-13... so episode #14 gets posted.

Here's a sample subject line.

Jubei-Chan Full Repost [15/26] - "Jubei-Chan - Ep 14 [P-A].part12.rar" yEnc (01/51)


Does:
JUBEI.CHAN..?.?(0[1-9]|1[0-3]) fit the bill?

Usually there's a pretty consistant naming convention for the eps, and you should be able to find a way to tell which is the ep number, such as:
"Jubei-Chan - Ep (0[1-9]|1[0-3])" or "Jubei-Chan.*(0[1-9]|1[0-3]) \[P-A"
User avatar
Smite
Katamari Damacy Addict
 
Posts: 5318
Joined: Sat May 19, 2001 1:54 am
Location: Alberta, Canada

Registered Newsbin User since: 03/27/03

Postby Takamatsu » Mon Sep 08, 2003 10:06 am

Quade wrote:Why not just put "Jubei-Chan - Ep 14" in the reject filter?


Because I WANT episode 14, not reject it.

Often file names from posting of one episode do not match postings of another episode.

Besides... my true problem isn't the regexp itself...

It's that the program refuses to add it to the reject list.

The question is...

Does the program sort of compare the regexp rule against preexisting ones? Because after I have added about 10 rules, it stops allowing me to add some.
User avatar
Takamatsu
Active Participant
Active Participant
 
Posts: 69
Joined: Sat Aug 16, 2003 12:50 pm

Registered Newsbin User since: 04/08/03

Postby Quade » Mon Sep 08, 2003 12:02 pm

If they're invalid regex then you can't add them. I know I've had more than 10 in there. I'll have to experiment with it.
User avatar
Quade
Eternal n00b
Eternal n00b
 
Posts: 44984
Joined: Sat May 19, 2001 12:41 am
Location: Virginia, US

Registered Newsbin User since: 10/24/97

Postby Takamatsu » Mon Sep 08, 2003 9:12 pm

Okay... to make it simple, I'll post my regexp list, straight from my Filters.xml

DRAGON.DRIVE[ -]+([0-2][0-9]|3[0-8])
EVANGELION.(0[1-9]|1[0-9]|2[0-6])
FUSHIGI.YUUGI[ -_]+EP.([0-4][1-9]|5[0-2])
GUNDAM.*SEED[.]*([0-3][0-9]|4[0-6])
JUBEI.CHAN[ -]+EP.(0[1-9]|1[0-3])
KODOCHA[ -]+(00[1-9]|01[0-9]|02[0-3]|02[5-9]|03[0-5]|05[2-8]|06[0-4]|08[2-3])
NGE.(0[1-9]|[1][0-9]|2[0-6])
RUROUNI.KENSHIN
SAMURAI.GIRL[ -]+(0[1-9]|1[0-3])
ULTRA.MANIAC.([0][0-9]|1[0-3])

Now I'm trying to add this reg exp

INUYASHA[ -]+([0-9][0-9] |1[0-1][0-9]|1[2][0-2])

and it refuses to add it.

I remove one of the above, and it accepts it.
User avatar
Takamatsu
Active Participant
Active Participant
 
Posts: 69
Joined: Sat Aug 16, 2003 12:50 pm

Registered Newsbin User since: 04/08/03

Postby Takamatsu » Fri Sep 12, 2003 10:56 pm

Any news about this problem? Should I post it as a problem in Tech Support?
User avatar
Takamatsu
Active Participant
Active Participant
 
Posts: 69
Joined: Sat Aug 16, 2003 12:50 pm

Registered Newsbin User since: 04/08/03

Postby Smite » Sat Sep 13, 2003 12:34 am

No, I think there's a known problem somewhere in there. A combination of complexity of regex, and number of filters.
User avatar
Smite
Katamari Damacy Addict
 
Posts: 5318
Joined: Sat May 19, 2001 1:54 am
Location: Alberta, Canada

Registered Newsbin User since: 03/27/03

Postby Quade » Sat Sep 13, 2003 1:36 am

State machine gets too bug for the allocated storage.
User avatar
Quade
Eternal n00b
Eternal n00b
 
Posts: 44984
Joined: Sat May 19, 2001 12:41 am
Location: Virginia, US

Registered Newsbin User since: 10/24/97

Postby Takamatsu » Sat Sep 13, 2003 10:24 am

So best way to resolve it is to not to get too complicated with the regexp? I'll try and see what works.

Thanks.
User avatar
Takamatsu
Active Participant
Active Participant
 
Posts: 69
Joined: Sat Aug 16, 2003 12:50 pm

Registered Newsbin User since: 04/08/03

Postby Quade » Sat Sep 13, 2003 11:08 am

I'm looking at it too, to see if I can increase the space and not break anything.
User avatar
Quade
Eternal n00b
Eternal n00b
 
Posts: 44984
Joined: Sat May 19, 2001 12:41 am
Location: Virginia, US

Registered Newsbin User since: 10/24/97

Postby Stork » Sat Oct 15, 2005 9:48 pm

I am putting this in the "Reject File Filters"

JUBEI.CHAN.*(0[1-9]|1[0-3])

The idea is to eliminate episodes 1-13... so episode #14 gets posted.

Here's a sample subject line.

Jubei-Chan Full Repost [15/26] - "Jubei-Chan - Ep 14 [P-A].part12.rar" yEnc (01/51)

wouldn't JUBEI.CHAN.*(0[1-9]|1[0-3]) possibly get cought up in the [15/26] after repost or (01/51) near the end of the subject?

better would be if it was "JUBEI.CHAN.EP (0[1-9]|1[0-3])" minus quotes of course.
Stork
n00b
n00b
 
Posts: 1
Joined: Mon Aug 04, 2003 10:49 am

Registered Newsbin User since: 04/13/03

Postby Smite » Sat Oct 15, 2005 9:51 pm

Not if it's a "File Filter". The filenames does not include any of those extra numbers.
Please read the FAQ before asking any questions.
If you're new to newsgroups, and the files on them, you can find a very helpful guide here.
User avatar
Smite
Katamari Damacy Addict
 
Posts: 5318
Joined: Sat May 19, 2001 1:54 am
Location: Alberta, Canada

Registered Newsbin User since: 03/27/03


Return to Regular Expressions

Who is online

Users browsing this forum: No registered users and 0 guests