Corrected placement of _bz_build_schema_from_disk accidentally move it under Bugzilla::DB::Mssql::st during some cleanup. Incidentally, why are not making separate st.pm files?

Attachment #372701 - Attachment is obsolete: true

Attachment #372701 - Flags: review?(mkanat)

Michael Thomas (Mockodin)

Reporter

•

16 years ago

Attached patch Patch for Bugzilla::DB::Mssql (obsolete) — Details — Splinter Review

Cleaned up un-needed functions.

Attachment #372742 - Attachment is obsolete: true

Attachment #372952 - Flags: review?(mkanat)

Attachment #372742 - Flags: review?(mkanat)

Michael Thomas (Mockodin)

Reporter

Comment 7

•

16 years ago

Attached patch Patch for Bugzilla::DB::Mssql (obsolete) — Details — Splinter Review

Minor updates to remove some over thinking on my part. removed the custom TO_DAYS , FROM_DAYS functions in favor of directly calling DATEDIFF and DATEADD

Attachment #372952 - Attachment is obsolete: true

Attachment #372952 - Flags: review?(mkanat)

Michael Thomas (Mockodin)

Reporter

Updated

•

16 years ago

Attachment #374521 - Flags: review?(mkanat)

Frédéric Buclin

Comment 8

•

15 years ago

Comment on attachment 374521 [details] [diff] [review] Patch for Bugzilla::DB::Mssql >+use constant GROUPBY_REGEXP => '((CASE\s+WHEN.+END)|(SUBSTR.+\))|(TO_CHAR$.+$)|($SCORE.+$)|($MATCH.+$)|(\w+(\.\w+)?))(\s+AS\s+)?(.*)?$'; Note that GROUPBY_REGEXP no longer exists. It can go away from your file.

Michael Thomas (Mockodin)

Reporter

Comment 9

•

15 years ago

Attached patch Patch v5 (obsolete) — Details — Splinter Review

Corrected the creation logic on triggers to create single Update and delete trigger instead of one for one when using a trigger instead of a FKey. This required refactoring of the schema where references where concerned. In the cases where a FKey is still used added logic to determine if index can be filtered (ignore nulls) or if a normal index should be used A result of the mix triggers and fkeys is that schema changes will trigger them to be completely refreshed. By that I mean dropped and recreated. This includes when creating custom multi select fields. Running checksetup against mssql takes this into account and rebuilds the custom field table fkeys/triggers in addition to the default bugzilla tables fkeys/triggers. I need to run a few more tests against a loaded data set, about 200k bugs, 1.5M comments. And see if I need to change this a bit, I probably need to make this only drop and recreate if something changed for that specific FKey or associated tables. A fair number of other adjustments, overall performance under mssql improved greatly. the biggest limitation at this point I think is the speed of TT2 under windows. Pages like the admin edit user screen are slow while it parses the product responsibilities (default assignee or QA contact etc..) I'm guessing these are a bit sluggish, comparatively to other pages, under any OS and Web server however.

Attachment #374521 - Attachment is obsolete: true

Attachment #390548 - Flags: review?(mkanat)

Attachment #374521 - Flags: review?(mkanat)

Michael Thomas (Mockodin)

Reporter

Updated

•

15 years ago

Attachment #390548 - Flags: review?(mkanat)

Michael Thomas (Mockodin)

Reporter

Comment 10

•

15 years ago

Attached patch v6 (obsolete) — Details — Splinter Review

Does NOT included the source code for the CLR Object

Attachment #390548 - Attachment is obsolete: true

Michael Thomas (Mockodin)

•

15 years ago

re Bugzilla::DB::StatementModifier Oracle uses this Bugzilla::DB::Oracle::_fix_hashref($ref); in most of the functions, I dropped it's usage in MSSQL, should be fine to create Bugzilla::DB::StatementModifier by changing it to Bugzilla::DB::_fix_hashref($ref) and just return for mssql. Or alternatively perhaps put it in Util.pm as it could be used for non-sql operations. Make it a constant to to see if we need to use it or not. Thoughts?

Max Kanat-Alexander

Comment 18

•

15 years ago

(In reply to comment #15) > Noting I related alot of nits or items to being the result of copy and paste > from Mysql.pm or Oracle.pm I will comment those items as <CPItem> purely to > save typing a full response, generally take it mean that the item is removable > or simple acknowledgment of the comment with the pointer back to a Copy and > Paste. That's fine. > > Untrue. > > <CPItem> Does need to be fixed. Should also be fixed in Oracle.pm by Xiaoou, but that's a separate issue. > Actually that shouldn't be there, some internal code here uses it. Okay. Your internal code won't work right under mod_perl, FWIW. > > >+# In reality, you could have a LOT more comments than this, because > > >+# MAX_COMMENT_LENGTH is big. > > >+use constant MAX_COMMENTS => 50; > > > <CPItem> Yeah, should go. > > >+use constant EMPTY_STRING => ''; > > > > You don't need to specify that. > > So long as that empty string placer holder doesn't get used.. <CPItem> It's Oracle-specific, because oracle thinks an empty string IS NULL. > > Um, shouldn't that be specifyable somehow? Or at least in a comment. > > YES, left in that manner though since we don't seem to allow for this in other > packages, how might we want to do this? Optimally it would be a item in > localconfig I would assume, but unique to mssql felt odd when I thought about > it. And this really is built for the SQL Server Native Client 10.0 driver. > Which is freely downloadable from the Microsoft. Hmm. So there's no way to have it auto-specify that, then, or just let something figure it out on its own? Does it really have to have an explicit version specifier? At the worst, it should be in a constant at the top of the file. > MARS is Microsoft's version of Multiple Active Results Sets Okay, thanks. So I assume that's what lets us have multiple selects going at once on one connection? > App allows you see connection in mssql activity console and know what > application is connecting otherwise reports as apache, which is not useful when > you may have more than one apache app connection.. which I do. Ah, that's great, okay. You might want to use $0 instead, though? > PWD: I'll test it later. Copied from connections string site. Okay. :-) Yeah, $user and $pass should go into DBI arguments instead of the DSN. > > >+ my $attrs = { odbc_default_bind_type => '-9', LongReadLen => 5000000 }; > > > > What are those for? Also, why isn't LongReadLen a constant? I didn't see an answer there. (Was mostly curious what the odbc_default_bind_type is.) > Referencing a ..um.. disagreement you had with another contributor on yahoo > groups, least ways I mean that in that you shot down his response to someone > else.. yes it can work perfectly fine. It just takes effort to compile so > modules. Okay. Have those patches been sent to the maintainer of those modules? > That said similar to %lock this could be removed. Well, it doesn't hurt. But the code you have there (list of references, not a reference to a list) almost certainly doesn't actually work. > > Ummm, there's no way to get query analysis data out of MS-SQL via SQL? > > If I work hard enough, probably, the effort I did put in at one point showed > that what builtin function there were, were basically less useful, I mean that, > than the raw sql being displayed. So yeah, unless someone has time to find a > real way to accomplish this. Well, the raw SQL is always displayed. What are the built-in functions? This is not critical; it just helps with analyzing query.cgi slowness. > > Umm, what if we just inserted several rows into several different tables? > > It's not something that I think we do, but what if we did? > > > Then we would have a problem, but is that different than mysql's > LAST_INSERT_ID() Oh, yeah, I suppose you're right. OK. > > >+ return "dbo.REGEXP($expr,$pattern, 0) = 1"; > > > > What's the 0 for? > > One of my custom CLR functions, to enable (1) or disable (0) case sensitivity Oh, that's nice. Okay. > > Nit: Should have a better variable name. > > All come on :-) you know you like it. LOL! > You will run you server dry of memory and possibly swap space before you fill a > nvarchar(max) if I recall think terabytes. Granted that is a possible size not > a practical one. Ahhh, okay. I was thinking of other databases where the max varchar size is like 4000 characters. > > Then let's not have the code here. But I don't see why it would be in a GROUP > > BY. > > Don't recall the scenarios, but if I removed it its because I ran in to it with > no work around. Will review notes. Okay. I'd be happy to work with you on it; it's nice to have real fulltext. > > Why the spaces around the SQL? > > Readability in code, but no reason, probably paste it from a query window. Okay. For consistency with the other drivers, let's get rid of the spaces. > > >+sub sql_group_concat { > > >+ my ($self, $column, $separator, $sortorder) = @_; > > > > We don't support $sortorder yet, so I'd wait until we actually have that > > before adding it to this patch. > > With out altering the CLR, which I'd rather not it still need to be referenced > internally to the function, but the param could be dropped. Okay. But you might have to alter the CLR anyway, when we actually add this. sortorder will be a column name or an expression, not ASC or DESC or something like that. > > That should be an install_string. Also, it probably shouldn't happen every > > time somebody runs checksetup.pl. If it has to happen every time, then there > > shouldn't be a string printed. > > Yes can be moved there, was living there so that I didn't have to wipe my > database with each CLR edit. Hadn't really occurred to me, but makes sense. I > don;t suppose there is a commandline switch to trigger specific install > functions, at least for this it would be useful at times. Eg migrated data to a > new database and you want to insure everything is enabled and registered etc.. > That said it only takes about 2 seconds to deploy as is...? Perhaps you should put a version number into the CLR or include some sort of date about it in the database, so we could check if we have to update it. > > Instead of having an enormous string inside of this file, which will be > > loaded into memory for every httpd child, could the CLR code live in an > > external file? > > ..Yes.. we had some debate here on that actually. Feel free to side with me on > that one :-), since it appears you are :-P Yes, I am siding with you. :-) > Hehe get a bigger monitor :-) Wrapping on 80 when possible yes? Think I heard > about that after most of this was complete. Yes, wrap on 80 when possible. > > >+ $self->do("IF EXISTS (SELECT name FROM sys.procedures WHERE name = N'sp_alter_primarykey' and type = 'P') > > > > You're telling me that there's no way to do this with normal SQL? > > Yep, unless someone knows a trick I haven't see, always possible. This seems to indicate it works just like other DBs: http://msdn.microsoft.com/en-us/library/ms174123.aspx > > Instead of copying all of the code, you should be calling > > SUPER::bz_add_field_tables and then doing your custom piece afterward. Also, > > your function name should start with bz_ or _bz_. > > Will review, there was a reason at one point, might have been more relevant > before I ended up killing all the foreign keys in favor of triggers. Okay. > > >+sub get_random_name() > > > > This already exists in Bugzilla::Util as generate_random_password. > > Again, I think there was a reason, will review. Without looking at the code, it might be that generate_random_password can use numbers and you don't want them in the name? In any case, if you keep this sub as a utility locally in the module, its name should be prefixed with an underscore. > > >+sub adjust_statement { > > Please factor this code out into a separate function if you're going to share > > it with Oracle. > > Not sure what your asking, the whole function?, the mssql specific items? The part that splits the string and parses it in parts. If you need to do something to each part, you can give the new function a callback that tests each part. > > >+ # Look for a LIMIT clause > > >+ ($limit) = ($nonstring =~ m(/\* LIMIT (\d*) \*/)o); > > > > Why do you essentially have this code twice? > > Is in Oracle.pm I think perhaps we should audit that code. Perhaps ask Xiaoou why he put it there. > > Also, I think you don't need $has_from, $has_select, or probably this entire > > loop. > > > > In Oracle.pm Yes, but it uses those. > > Also, you can't just paste a WHERE clause on some SQL--it's not valid at the > > end of every statement. And what if we start using UNION? > > Again in Oracle.pm Seriously? Wow...I thought Oracle did all of this inside the loop. > > >+ my ($before_where, $after_where) = split /\bWHERE\b/i,$new_sql; > > > > What if it has the word WHERE in a literal string? (Like, what if I search > > for "WHERE" on query.cgi?) > > One hopes that your using placeholders which haven't been populated at this > point. Beyond that no idea, see Oracle.pm We're not using placeholders in Search.pm. > > >+ if (defined($offset)) { > > >+ if ($new_sql =~ /(.*\s+)FROM(\s+.*)/i) { > > > > And what if it has the word FROM in a literal string? > > ditto Oracle.pm Wow, you are correct. It makes no sense, let's see if you can fix it better for MS-SQL. > > Why are you doing it before FROM? We need to limit after GROUP BY. > > Because we are wrapped in a () need to name the result set for mssql to see it > as valid, randomizing a name here lets sql keep trucking along. Why not just wrap the entire query and do a SELECT *? > > Also, why are you doing this here instead of in sql_interval? > > If I recall because we have to factor in an interval of a interval or some such > thing, I believe this occurs in whine.pl or collectstats.pl (or what ever it > was called) So if it could be cleaned up there... then we can I suppose. Yeah, I'd way rather see this in sql_interval. If there's some code that's not using sql_interval, it should be. > > >+ $new_sql =~ s/SUBSTRING$(.*) FROM (\d+) FOR (\d+)$/SUBSTRING($1, $2, $3)/igo; > > > > MS-SQL doesn't support the ANSI syntax for that? > Nope, we discussed this before, also in relation to the 3rd param being > required, which should be, else you are relying on an implicit rather than > explicit which can be confusing at times. Lame. Yeah, I do seem to recall we discussed it. Anyhow, you'll have to fix this inside of the parts loop, not outside of it. > An I don't feel you were harsh at all, quite enjoyed reading the feedback. > Thanks for you effort on this and Bugzilla as a whole. I'm glad. :-) Hey, you're welcome. :-)

Max Kanat-Alexander

Comment 19

•

15 years ago

(In reply to comment #17) > Oracle uses this Bugzilla::DB::Oracle::_fix_hashref($ref); in most of the > functions, I dropped it's usage in MSSQL, should be fine to create > Bugzilla::DB::StatementModifier by changing it to > Bugzilla::DB::_fix_hashref($ref) and just return for mssql. Or alternatively > perhaps put it in Util.pm as it could be used for non-sql operations. Make it a > constant to to see if we need to use it or not. Hmm. Yeah, tricky, because it's being used in two classes. There are some Perl OO tricks you could do to handle it....create another class called Bugzilla::DB::HashrefFixer, and then do some clever stuff with @ISA in Oracle.pm, but I'm not sure that's a good idea. I don't know, let me know what you come up with. Don't put it in Util.pm, though, and I'd prefer it wasn't in Bugzilla::DB directly, also.

Michael Thomas (Mockodin)

Reporter

•

15 years ago

Depends on: 557929

Max Kanat-Alexander

Comment 25

•

15 years ago

(In reply to comment #20) > So on the CLR import code (the really big chuck of random looking text) what > shall we do.. create /bugzilla/Bugzilla/DB/Mssql/CLR.inc ? > > We can then read the file during install or as needed? That sounds fine. Or you can put it in contrib with the rest of the CLR stuff.

Max Kanat-Alexander

Comment 26

•

15 years ago

(In reply to comment #22) > > Does need to be fixed. Should also be fixed in Oracle.pm by Xiaoou, but > > that's a separate issue. > > We'll need to work with Xiaoou to move this all to > Bugzilla::DB::StatementModifier anyhow so addressable then.. Okay, although that has to happen before this patch can pass review. > If you set up a System DSN, it kinda is, but your just moving where you > manually selected at that point. And yes you can have multiple version of the > driver installed one for e SQL 2005, for SQL 2008, etc.. Updates to a version > are left named the base name, so doesn't update very often. Constant would > work, maybe a prompt on install and store in data/params would be best though? So you're telling me there's no way to just say "{SQL Server}" without a version number and have it use the latest one available? > > Okay, thanks. So I assume that's what lets us have multiple selects going at > > once on one connection? > Yes, works well, though like the normal multiple result set warnings apply, I'm not aware of any problems with multiple result sets in other DBs. > Reasonable thought, makes for even better identification. Might still recommend > concatenating $0 and bugzilla ('Bugzilla '.$0 on the off chance another app has > the same file name. $0 is usually the full path to the current file, so you should be OK, but sure, "Bugzilla: $0" would be fine. > > > > What are those for? Also, why isn't LongReadLen a constant? > > odbc_default_bind_type => '-9' == SQLWCHAR allows for NVARCHAR Okay. Could you put a comment next to it? > LongReadLen sure I can make it a constant. Okay. And you're sure that you need LongReadLen for MS-SQL? > > Okay. Have those patches been sent to the maintainer of those modules? > > If I recall didn't need patches. Was just an issue of compiling on the server, Okay, in that case you should contact the maintainer of the theoryx5 PPM repository and ask him to package the modules for you, and if he says there's a problem, assist him in resolving the issues. > Might I suggest rather than just column to all for any of? I'm sorry, that sentence doesn't make sense to me. > Sometimes you might > just want the return sorted internally instead of based on another field. Not > much different than a subselect with a order by. This isn't an appropriate place to discuss this feature, which will happen in another bug. > > Perhaps you should put a version number into the CLR or include some sort of > > date about it in the database, so we could check if we have to update it. > > That I can do, in fact had one and pulled it out when I submitted the code. Ah > well :-) Ah, sounds good then. :-) > > This seems to indicate it works just like other DBs: > > > > http://msdn.microsoft.com/en-us/library/ms174123.aspx > > If I recall try it, and you'll get fun errors. Might have just given up late at > night, I'll review. I know I did some investigating before hand. Okay. If you can't get it to work with normal SQL, record the errors you're getting. If you need help, I can help research solutions or make suggestions. > > Why not just wrap the entire query and do a SELECT *? > > It is. In fact, I think, two of them Sort DESC TOP then Sort ASC TOP to give > the > effect of limit x, y What it looks like you're actually doing is splitting on the existing FROM, though...perhaps I'm incorrect? > OK, I'll look at altering whine, it is using sql_interval but like this > > $dbh-sql_interval(blah..) . ' - ' . $dbh-sql_interval(blah..) That's fine...are you saying the problem is that MS-SQL cannot do math with the interval formats you've created? > > Lame. Yeah, I do seem to recall we discussed it. Anyhow, you'll have to fix > > this inside of the parts loop, not outside of it. > > Think the issue was it splitting it up oddly, perhaps not I'll see if it works. Yes, it will get split up oddly, but you have to work around that.

Max Kanat-Alexander

•

15 years ago

(In reply to comment #27) > Also, before I forget--if the utf8 parameter is on in Bugzilla, you need to > set odbc_utf8_on in the driver: > http://search.cpan.org/dist/DBD-ODBC/ODBC.pm#odbc_utf8_on > > Or alternately, you may need to check if odbc_has_unicode is enabled and die > if not. From that page: >When building DBD::ODBC on Windows ($^O eq 'MSWin32') the WITH_UNICODE macro >is automatically added >Do not confuse this with DBD::ODBC's unicode support. The odbc_utf8_on >attribute only applies to non-unicode enabled builds of DBD::ODBC. Taking those togeather, it means not required under Windows.

Bill Barry

Comment 31

•

15 years ago

(In reply to comment #23) > > > > > + CREATE ASSEMBLY [Bugzilla.MSSQL] > > > + AUTHORIZATION [dbo] > > > + FROM 0x4D5A9000030... > > This varbinary constant here is the actual bits of the assembly, isn't it? > > Yes I would assume you shouldn't need some sort or other file to read, just use the dll itself. There's gotta be a short bit of perl that can read a file and store it as a hex string. > > > +public partial class UserDefinedFunctions > > > > This class is not partial, it is fully defined. This class should be in a > > Bugzilla.Mssql namespace. This class should be marked static. > > It being marked partial is based on the Visual Studio defaults, is there a > reason it should be marked otherwise? Ie is it broken as partial or gain > something being marked full? No performance reason or brokenness, just the better to be explicit logic rearing its head again. The word partial being there in the diff had me looking for a second half of the class. > > > + public static SqlInt32 INSTR([SqlFacet(MaxSize = -1)]SqlString find, [SqlFacet(MaxSize = -1)]SqlString inthis, SqlBoolean CaseSensitive) > > > > Function name should be InStr > > parameter inthis should be inThis > > parameter CaseSensitive should be caseSensitive > > OK, though I have never understood the mid word caps without capping the > first.. http://msdn.microsoft.com/en-us/library/ms229043.aspx Seems like it is arbitrary to me, but this was the decision made and it is largely followed. see also: http://msdn.microsoft.com/en-us/library/ms229002.aspx > > That said, I think there are some bugs here: > > I doubt that %H, %h, %I and %k are supposed to return the same value (similarly > > U,u,V,v, and X,x,Y). > > Is %f correct or should it also be PadLeft? > > You have an if for capital S and then do a replace with lowercase s. > > Could a test be written to ensure the correctness of this function? > > Are there l10n issues here? > > It's been tested here, in reality only a subset of the items are used by > Bugzilla. Most of this was an attempt to clone the MySQL function of the same > name. I can review again, wrote this about... a year ago :-) fuzzy now. Perhaps this assembly should be spun off to its own project, not under the stewardship of Bugzilla. I am certain it has usefulness outside of Bugzilla. Something like ???.SqlServer.MysqlExtensions (not concerned about this now though, maybe someday). > > These should be public to prevent a future compiler from obfuciating them. > > Well except then they can be altered externally. Is there another way? And > there is little point to obfuscating this code given its open source. Altered externally? What do you mean? I was talking about obfuscating as part of some future compiler optimizations that might come up. Object names are not required to remain in the compiled assembly if they are not publically accessable per the spec IIRC. Some future compiler could very well notice that they are not public and simply use their integer values instead of the enums. > > > + private static int WeekOfYear(SqlDateTime SqlDate) > > > + { > > > + if (SqlDate.IsNull) return 0; > > > + > > > + DateTime date = SqlDate.Value; > > > + > > > + System.Globalization.CultureInfo ci = System.Threading.Thread.CurrentThread.CurrentCulture; > > > + System.Globalization.Calendar cal = ci.Calendar; > > > + System.Globalization.CalendarWeekRule cwr = ci.DateTimeFormat.CalendarWeekRule; > > > + DayOfWeek fdow = ci.DateTimeFormat.FirstDayOfWeek; > > > + return cal.GetWeekOfYear(date, cwr, fdow); > > > + } > > > > parameter being passed in is DateTime, not SqlDateTime (makes first if and > > assignment unnecessary) > > if syntax inconsistant again > > using CurrentThread.CurrentCulture here, is this ok in sql app domain context > > (I think so but I am not certain)? > > You use the current culture here but previously you used InvariantCulture for > > case sensitivity. Does that present an internationalization issue (how does > > mysql do case sensitivity particularly for accented characters? does it vary > > per culture and if so should this or should it do it however mssql does case > > sensitivity when it is configured as insensitive, if they are different?)? > > Mmm, Can run some tests see if it is an issue? Or provide some case example to > test? I cobbled this together I think it is OK, but as commonly is I'm testing > many thing from my situation. I think it might be OK too, but I wasn't certain and this is an area where I don't feel comfortable enough to say everything is fine. As a developer my weakest area overall is i18n. I'm happy to blame it on US culture, that for a pretty long time we never gave any thought to writing code that would be used in other countries. Fortunately this is starting to change, but it is still a very foreign concept to many devs I have met. > >this keyword is not necessary > Can you explain further, not really sure what you are referring too. this.member can almost always just be retrieved by member. It is a convention choice more than anything. I prefer in my code to write as little cruft as possible. > >Read is not a compliment to Write > Can you explain further? http://msdn.microsoft.com/en-us/library/microsoft.sqlserver.server.ibinaryserialize_members%28v=VS.90%29.aspx Pretty sure that the following test would fail: var concat = new GROUP_CONCAT(); concat.Init(); concat.Accumulate(new SqlString("1"),new SqlString(", "), new SqlString("ASC")); concat.Accumulate(new SqlString("2"),new SqlString(", "), new SqlString("ASC")); concat.Accumulate(new SqlString("3"),new SqlString(", "), new SqlString("ASC")); var concat2 = new GROUP_CONCAT(); concat2.Init(); concat2.Accumulate(new SqlString("4"),new SqlString(", "), new SqlString("ASC")); concat2.Accumulate(new SqlString("5"),new SqlString(", "), new SqlString("ASC")); concat2.Accumulate(new SqlString("6"),new SqlString(", "), new SqlString("ASC")); concat.Merge(concat2); var ms = new MemoryStream(); using(var w = new BinaryWriter(ms)) { concat.Write(w); } ms.Seek(0); var concat3 = new Group_CONCAT(); using(var r = new BinaryReader(ms)) { concat3.Read(r); } var ms2 = new MemoryStream(); using(var w2 = new BinaryWriter(ms)) { concat3.Write(ms2); } var ms1Array = ms.ToArray(); var ms2Array = ms2.ToArray(); Debug.Assert(concat.Terminate().Value == concat3.Terminate().Value); Debug.Assert(ms1Array.Length == ms2Array.Length); for(int i=0;i<ms1Array.Length;++i) { Debug.Assert(ms1Array[i]==ms2Array[i]); } > > NaturalSortComparer.cs > > ====================== > > Assuming this code is correct (not even going to bother going there at least > > yet) my most significant issue is that the variable names are not very > > explicit. In my company we use _camelCase for fields, camelCase for method > > variables and parameters and PascalCase for just about everything visible > > outside of the class. > > These classes should also be in a Bugzilla.Mssql namespace and they should be > > in separate files per class (at least according to C# conventions that I know > > of, perhaps the bugzilla devs do not care to follow this). > > > > I am finding myself questioning what the whole point of this class is. > > What advantages does it provide that Bugzilla might need that will permit it to > > waste so much time (parsing strings) as part of the sql execution? > > Used in GroupConcat for sorting, without if you get 'human' unexpected results > eg sort 1,2,30,40,6,7,8,9,10,20,50,60,3,4,5, > you get 1,10,2,20,3,30,4,40 or something along those lines > What should imo return is > 1,2,3,4,5,6,7,8,9,10,20,30,40,50,60 > > That is what it does. What I meant was, what advantages does it have over something far simpler (easier to review, likely significantly faster) like the code below. public class NumericComparer : IComparer<string>, IComparer { public int Compare(string x, string y) { if(string.IsNullOrEmpty(x) || string.IsNullOrEmpty(y)) { return string.Compare(x, y); } double numX, numY; if (double.TryParse(x, out numX) && double.TryParse(y, out numY)) { return numX.CompareTo(numY); } return string.Compare(x, y); } public int Compare(object x, object y) { return Compare(x as string, y as string); } } > Thanks for reviewing! I'll see about applying changes in the next few days. You are welcome, Thanks for not taking it too harshly, it's my first public review.

Michael Thomas (Mockodin)

Reporter

Comment 32

•

15 years ago

(In reply to comment #31) > (In reply to comment #23) > > > > > > > + CREATE ASSEMBLY [Bugzilla.MSSQL] > > > > + AUTHORIZATION [dbo] > > > > + FROM 0x4D5A9000030... > > > This varbinary constant here is the actual bits of the assembly, isn't it? > > > > Yes > I would assume you shouldn't need some sort or other file to read, just use the > dll itself. > > There's gotta be a short bit of perl that can read a file and store it as a hex Yeah probably, but then, is it necessary vs. us releasing a preconfigured file. On obfuscation, I guess to me it boils down to, is it broken, is obfuscation even an issue if it changes the name? I don't think it is. Being what it is, it won't break anything. > I think it might be OK too, but I wasn't certain and this is an area where I > don't feel comfortable enough to say everything is fine. As a developer my > weakest area overall is i18n. I'm happy to blame it on US culture, that for a > pretty long time we never gave any thought to writing code that would be used > in other countries. Fortunately this is starting to change, but it is still a > very foreign concept to many devs I have met. Same here, but in this case, baring using tool to guess the incoming encoding, and setting culture to suite you end up having to accept it might not be perfect. I use a adapted guess encoding script found here http://www.codeproject.com/KB/recipes/DetectEncoding.aspx which might give us what we want but it probably unnecessary and would likely be high overhead. I use it for db conversions, mysql utf8 to mssql UCS-2 (a UTF-16 variant) > > >this keyword is not necessary > > Can you explain further, not really sure what you are referring too. > > this.member can almost always just be retrieved by member. It is a convention > choice more than anything. I prefer in my code to write as little cruft as > possible. Ah OK, matters not to me. We can alter it. > > >Read is not a compliment to Write > > Can you explain further? > > http://msdn.microsoft.com/en-us/library/microsoft.sqlserver.server.ibinaryserialize_members%28v=VS.90%29.aspx > > Pretty sure that the following test would fail: > > var concat = new GROUP_CONCAT(); > concat.Init(); > concat.Accumulate(new SqlString("1"),new SqlString(", "), new > SqlString("ASC")); > concat.Accumulate(new SqlString("2"),new SqlString(", "), new > SqlString("ASC")); > concat.Accumulate(new SqlString("3"),new SqlString(", "), new > SqlString("ASC")); > var concat2 = new GROUP_CONCAT(); > concat2.Init(); > concat2.Accumulate(new SqlString("4"),new SqlString(", "), new > SqlString("ASC")); > concat2.Accumulate(new SqlString("5"),new SqlString(", "), new > SqlString("ASC")); > concat2.Accumulate(new SqlString("6"),new SqlString(", "), new > SqlString("ASC")); > concat.Merge(concat2); > > var ms = new MemoryStream(); > using(var w = new BinaryWriter(ms)) { > concat.Write(w); > } > ms.Seek(0); > var concat3 = new Group_CONCAT(); > using(var r = new BinaryReader(ms)) { > concat3.Read(r); > } > var ms2 = new MemoryStream(); > using(var w2 = new BinaryWriter(ms)) { > concat3.Write(ms2); > } > var ms1Array = ms.ToArray(); > var ms2Array = ms2.ToArray(); > > Debug.Assert(concat.Terminate().Value == concat3.Terminate().Value); > Debug.Assert(ms1Array.Length == ms2Array.Length); > for(int i=0;i<ms1Array.Length;++i) { > Debug.Assert(ms1Array[i]==ms2Array[i]); > } I didn't test above, but compiled and running a very basic test of what I have, similar to your example above in sql see below: --SQL Snip DECLARE @tab TABLE (test int, test2 nvarchar(1)) DECLARE @var int SET @var=0 WHILE @var < 10 BEGIN INSERT INTO @tab SELECT @var, CASE WHEN @var < 6 THEN 'A' ELSE 'B' END SET @var = @var + 1 END SELECT dbo.GROUP_CONCAT(test, ',', 'ASC') FROM @tab group by test2 -- Result --Row 1 0,1,2,3,4,5 --Row 2 6,7,8,9 --End Snip > > > NaturalSortComparer.cs > What I meant was, what advantages does it have over something far simpler > (easier to review, likely significantly faster) like the code below. > > public class NumericComparer : IComparer<string>, IComparer { > public int Compare(string x, string y) { > if(string.IsNullOrEmpty(x) || string.IsNullOrEmpty(y)) { > return string.Compare(x, y); > } > double numX, numY; > if (double.TryParse(x, out numX) && double.TryParse(y, out numY)) { > return numX.CompareTo(numY); > } > return string.Compare(x, y); > } > public int Compare(object x, object y) { > return Compare(x as string, y as string); > } > } Can take into account items like Roman Numerals etc... if that matters.. maybe not much. Depends on what we want really, how far we want to go. Do we do only enough? > it's my first public review. You did good! If you disagree with me anything, stick to your guns if needed, MKanat or someone else can always arbitrate. :-)

Max Kanat-Alexander

Comment 33

•

15 years ago

(In reply to comment #29) > Correct, the drivers aren't called "SQL Server Native Client" they are called > with the major version number. > ie SQL Server Native Client 9.0 (2005) > SQL Server Native Client 10.0 (2008) Wow. That is utterly lame. Okay, here's what we should do: Allow people to put a space in $db_driver, for MS-SQL. The item after the space will be the version number. It will default to 10.0 if not specified. This will be only for MS-SQL. > > Okay. And you're sure that you need LongReadLen for MS-SQL? > > Had some truncation issues originally with out. None since even on a very very > large file or nvarchar return. Was the commonly recommended solution for the > truncation warnings I was receiving, believe in a MS article someplace as well. Okay, that sounds fine then. > On Where looks like, it works however the same for oracle, can't take credit > for how it works, beyond some modifications to mssqlize it where it differed > from Oracle. It was also a commonly recommended, see google search, method to > achieve LIMIT functionality with mssql. Okay. If it works, I'll let it slide, for now. > I'm doing a bit more, casting to insure an INT etc.. but that is that's the > basic result. If the datetime field or value we are adding or subtracting from > were passed to the function I could completely forgo the REGEX eliminating the > issue. I wonder that it wasn't built that way to begin with. It wasn't built that way to begin with because this is sql_interval, not sql_date_add or sql_date_subtract or sql_date_math or something like that. Is it possible to create a custom operator in MS-SQL for this? That is, you could return some data type that could be added or subtracted with a datetime...? sql_interval is used all over the code; I don't really want to change them all to sql_date_math, which would really get complex.

Max Kanat-Alexander

Comment 34

•

15 years ago

For these things that you're doing a global regex on the SQL with, one option is to put some delimiters around them that you think will never, ever appear in a Search.pm query, and then you could parse them out that way.

Max Kanat-Alexander

•

15 years ago

Also of note VS 2010, it now generates a sql deploy file as well, which includes the dll and associated files converted to a binary format. I have updated my development files for reading in and converting the dll for deploy as well. So a little more work on that side then I'll get that patch file up as well.

Max Kanat-Alexander

Comment 44

•

15 years ago

Okay, although I don't know much about what a lot of that means, it sounds positive. :-) Yeah, actually file a separate bug for the CLR code and attach the patch there. That way we can keep the reviews separate.

Michael Thomas (Mockodin)

Reporter

Updated

•

15 years ago

Blocks: 565720

Michael Thomas (Mockodin)

Reporter

Updated

•

15 years ago

No longer blocks: 565720

Depends on: 565720

Jasmin Sehic

Assignee

•

13 years ago

I will go through the comments here and apply all suggestions where applicable if they haven't been done already and post another patch update.

c1541

Comment 49

•

13 years ago

Why the INSTR function? Does the builtin CHARINDEX not do everything? http://msdn.microsoft.com/en-us/library/ms186323.aspx

Michael Thomas (Mockodin)

Reporter

•

13 years ago

Attached patch v9 (obsolete) — Details — Splinter Review

Attachment #559979 - Attachment is obsolete: true

Attachment #562295 - Flags: review?(mkanat)

Attachment #559979 - Flags: review?(mkanat)

Jasmin Sehic

Assignee

Comment 56

•

13 years ago

(In reply to Max Kanat-Alexander from comment #14) > Comment on attachment 436376 [details] [diff] [review] [diff] [details] [review] > v7 > > >=== added file Bugzilla/DB/Mssql.pm > >+# The Initial Developer of the Original Code is Netscape Communications > >+# Corporation. Portions created by Netscape are > >+# Copyright (C) 1998 Netscape Communications Corporation. All > >+# Rights Reserved. > > Untrue. > Fixed. > >+=head1 NAME > >+ > >+Bugzilla::DB::Mssql - Bugzilla database compatibility layer for MSSQL > > I know that a lot of other modules have this stuff up top, but could you > put the POD at the bottom? I've decided that's the best standard for us. > Done. > >+my %locks; > > Why do you have a package-level "my" variable? (Or any package-level > variable at all, for that matter.) > Removed. > >+# This is how many comments of MAX_COMMENT_LENGTH we expect on a single bug. > >+# In reality, you could have a LOT more comments than this, because > >+# MAX_COMMENT_LENGTH is big. > >+use constant MAX_COMMENTS => 50; > > You should not be overriding this in a driver. Why are you? Removed. > > >+use constant EMPTY_STRING => ''; > > You don't need to specify that. It is being used in some methods so I've left it there for now. > > >+use constant BLOB_TYPE => undef ; > > MS-SQL can bind blobs correctly without any special stuff to DBI? Looks like it. > > Nit: Semicolon right after "undef". > Fixed. > >+# This module extends the DB interface via inheritance > >+use base qw(Bugzilla::DB); > > Nit: This should be right under "package" and doesn't need the comment. > Done. > > >+ my $attrs = { odbc_default_bind_type => '-9', LongReadLen => 5000000 }; > > What are those for? Also, why isn't LongReadLen a constant? Mockdin explained what they are and I've made them into constants now. > > >+ # Needed by TheSchwartz > >+ $self->{private_bz_dsn} = \($dsn, $user, $pass, $attrs); > > Um... (a) You just tried to assign a list of references to a single hash > item (b) that's not the format of private_bz_dsn (c) does TheSchwartz even > support MS-SQL? TheSchwartz does not support ODBC and therefor MS-SQL. As it is using DBD and employs use of LIMIT and the fact it uses coalesce as one of the column names TheSchwartz module needs work to make it compatible. I have patched version that I got working which I will have to see what the TheSchwartz maintainers think about. The private_bz_dsn has been set up correctly now. > > >+ # all class local variables stored in DBI derived class needs to have > >+ # a prefix 'private_'. See DBI documentation. > >+ $self->{private_bz_tables_locked} = ""; > > That variable is actually obsolete and should be removed from all of > Bugzilla. Removed. > >+sub sql_regexp { > >+ my ($self, $expr, $pattern) = @_; > >+ > >+ $pattern = "'\\\\x00-\\\\x1f'" if ($pattern eq "'[[:cntrl:]]'"); > > I think that instead, you need to replace [:cntrl:] with the item there. Done. > >+sub sql_string_concat { > >+ my ($self, @params) = @_; > > >+ my @hack_for_hack; > > Nit: Should have a better variable name. > Better name was used :) > >+ foreach my $string (@params){ > >+ push @hack_for_hack, "cast($string as nvarchar(max))"; > > Nit: CAST should be capitalized. > Fixed. > >+# Currently Unsupportable, CONTAINS can not be used in GROUP BY > > Then let's not have the code here. But I don't see why it would be in a > GROUP BY. Added support for fulltext searching via the custom REGEXP function. > > >+sub sql_from_days { > >+ my ($self, $days) = @_; > >+ > >+ return " dateadd(dd,0, ( $days )) "; > > Why the spaces around the SQL? Fixed. > > >+sub sql_date_format { > >+ my ($self, $date, $format) = @_; > >+ > >+ $format = trim($format); > >+ $format = '%Y.%m.%d %H:%i:%s' unless $format; > >+ > >+ return " dbo.DATE_FORMAT($date,'$format') "; > > You just have a return here, and then there's a bunch of unreachable code > after it. Unreachable code removed. > >+sub LOCALTIMESTAMP { > > Why do you have this here? Removed. > > >+sub MOD { > > And this? Removed. > > >+sub bz_setup_database { > >+ my ($self) = @_; > >+ > >+ print "Regenerating CLR Functions for mssql..."; > > That should be an install_string. Also, it probably shouldn't happen every > time somebody runs checksetup.pl. If it has to happen every time, then there > shouldn't be a string printed. Removed printing. > > >+ $self->do("IF EXISTS (SELECT * FROM sys.objects WHERE object_id = OBJECT_ID(N'[dbo].[FROM_DAYS]') AND type in (N'FN', N'IF', N'TF', N'FS', N'FT')) > >+ DROP FUNCTION [dbo].[FROM_DAYS]"); > > Can you put this into a subroutine instead of repeating the same code 7 > times? Done. > > >+ $self->do(q| > > Usually we use q{ unless there's some reason not to. Done. > > >+ CREATE ASSEMBLY [Bugzilla.MSSQL] > >+ AUTHORIZATION [dbo] > >+ FROM > > [snip] > > Instead of having an enormous string inside of this file, which will be > loaded into memory for every httpd child, could the CLR code live in an > external file? Now loading the assembly dll file directly from contrib. > > >+ $self->do("CREATE FUNCTION [dbo].[FROM_DAYS](\@days [int]) > >+ RETURNS [datetime] WITH EXECUTE AS CALLER > >+ AS > >+ EXTERNAL NAME [Bugzilla.MSSQL].[UserDefinedFunctions].[FROM_DAYS] > >+ "); > > If you don't need to use ", then just use ' and you can remove the \ from > @days. (Same comment for all of these.) > > Also, our normal style would be to simply put the "); at the end of the > text. Done and done. > > These might be more readable using the heredoc syntax, though: > > $self->do(<<'END' > CREATE FUNCTION [dbo].[FROM_DAYS](@days [int]) > RETURNS [datetime] WITH EXECUTE AS CALLER > AS > EXTERNAL NAME [Bugzilla.MSSQL].[UserDefinedFunctions].[FROM_DAYS] > END > ); > > (You might want to use something different than "END" though.) > Done. > > >+ $self->do("IF EXISTS (SELECT name FROM sys.procedures WHERE name = N'sp_alter_column_default' and type = 'P') > > Nit: Very long line. Done. > > >+sub _bz_add_field_table { > > [snip] > > This is not code you should be overriding, and I don't see any reason why > you would have to. It looks just like a direct copy of the superclass's code. Removed. > > >+sub bz_add_field_tables { > >+ my ($self, $field) = @_; > >+ > >+ $self->_bz_add_field_table($field->name, > >+ $self->_bz_schema->FIELD_TABLE_SCHEMA, $field->type); > >+ if ($field->type == FIELD_TYPE_MULTI_SELECT) { > >+ my $ms_table = "bug_" . $field->name; > >+ $self->_bz_add_field_table($ms_table, > >+ $self->_bz_schema->MULTI_SELECT_VALUE_TABLE); > >+ > >+ $self->updatecreate_column_refs; > > Instead of copying all of the code, you should be calling > SUPER::bz_add_field_tables and then doing your custom piece afterward. Also, > your function name should start with bz_ or _bz_. > Removed as there was nothing custom. > > >+sub adjust_statement { > >+ my ($sql) = @_; > >+ > >+ # We can't just assume any occurrence of "''" in $sql is an empty > >+ # string, since "''" can occur inside a string literal as a way of > >+ # escaping a single "'" in the literal. Therefore we must be trickier... > >+ > >+ # split the statement into parts by single-quotes. The negative value > >+ # at the end to the split operator from dropping trailing empty strings > >+ # (e.g., when $sql ends in "''") > >+ my @parts = split /'/, $sql, -1; > > [snip] > > >+ # Look for a LIMIT clause > >+ ($limit) = ($nonstring =~ m(/\* LIMIT (\d*) \*/)o); > > Why do you essentially have this code twice? One is looking for a normal LIMIT and the other looks for LIMIT with offset. > Also, I think you don't need $has_from, $has_select, or probably this > entire loop. Those vars are not needed and have been removed. Loop is essential and has to remain. > > >+ if (defined($limit)) { > >+ if ($new_sql !~ /\bWHERE\b/) { > >+ $new_sql = $new_sql." WHERE 1=1"; > > Nit: Space around the period. Done. > >+ my ($before_where, $after_where) = split /\bWHERE\b/i,$new_sql; > > What if it has the word WHERE in a literal string? (Like, what if I search > for "WHERE" on query.cgi?) This was changed so literal strings cannot be affected now as replacements are done from the loop. > >+ $before_where =~ s/^(select +(distinct)?)/$1 top $limit /i; > > Nit: Use \s+ instead of a raw space with a plus character after it. Done. > > >+ $new_sql =~ s/CURRENT_DATE$?$?/CAST(GETDATE() as DATE)/igo; > > Oh, CURRENT_DATE in a raw string.... > > >+ $new_sql =~ s/LOCALTIMESTAMP$0$/GETDATE()/igo; # Might have to change this is we do more than (0) in the future > > And so on.... > > >+ $new_sql =~ s/NOW/GETDATE()/igo; > > And so on.... Done, done and done. > > >+ $new_sql =~ s/$?CAST\((\b[\w\.]+\(?$?) as (\w+$?$?)\) (\+|\-) \/\*INTERVAL (\d+|\?) (\b\w+\b)\*\/\)? (\+|\-) \/\*INTERVAL (\d+|\?) (\b\w+\b)\*\//CAST(DATEADD($5, $3$4, DATEADD($8, $6CAST($7 as int), $1)) as $2 )/; > > This is pretty hard to read. Could you use /x and make it more readable? > Done. Hopefully now it is much clearer what is actually happening. > > > >+sub db_lock { > > What's this for? Removed. > > >+# Should really live in it's own .pm file... > > Nit: its > > (it's is a contraction, "its" is a possessive pronoun like his or hers, > which also don't have apostrophes in them.) Corrected. > > >+# DNI function override for MSSQL sql adjustments > > DNI? (Is that supposed to be DBI?) > Corrected. > >+package Bugzilla::DB::Mssql::st; > > [snip] > > And these should go into their own class as well, that both MS-SQL and > Oracle can subclass. > > Let's call the package Bugzilla::DB::StatementModifier, and it can also > contain Bugzilla::DB::StatementModifier::st. Haven't done anything about this just yet as I am not quite sure about the scope of this change just yet and may need a bit of discussion before I am sure what needs to be done here. (In reply to Max Kanat-Alexander from comment #28) > (In reply to comment #24) > > MKanat mentioned moving items like > > > > >$new_sql =~ s/CURRENT_DATE$?$?/CAST(GETDATE() as DATE)/igo; > > > > inside the @parts loop, however, does running the regex mulitple times (once > > per loop) make more since than once at the end as it is now? > > Yes, because you *must* do replacements inside the loop, otherwise you're > also modifying strings that people are searching for or otherwise using in > SQL. This was done. (In reply to Max Kanat-Alexander from comment #33) > (In reply to comment #29) > > Correct, the drivers aren't called "SQL Server Native Client" they are called > > with the major version number. > > ie SQL Server Native Client 9.0 (2005) > > SQL Server Native Client 10.0 (2008) > > Wow. That is utterly lame. > > Okay, here's what we should do: > > Allow people to put a space in $db_driver, for MS-SQL. The item after the > space will be the version number. It will default to 10.0 if not specified. > This will be only for MS-SQL. I have only made the version number a constant in the pm for now. Allowing a space to be added in $db_driver will require a number of changes in DB.pm. Suggest we open another bug for this. (In reply to Max Kanat-Alexander from comment #35) > > >When building DBD::ODBC on Windows ($^O eq 'MSWin32') the WITH_UNICODE macro >is automatically added > > > > >Do not confuse this with DBD::ODBC's unicode support. The odbc_utf8_on >attribute only applies to non-unicode enabled builds of DBD::ODBC. > > > > Taking those togeather, it means not required under Windows. > > Yes, but somebody may want to host Bugzilla on *nix with the MS-SQL server > on Windows, so we have to make this check--probably just assure that we have > odbc_has_unicode on. odbc_utf8 attribute has been added.

Max Kanat-Alexander

Comment 57

•

13 years ago

Wow, amazing! Thank you, Jasmin! I will get to reviewing this some time in the next few days. If I don't, please email me directly.

Jasmin Sehic

Assignee

Comment 58

•

13 years ago

Attached patch v10 (obsolete) — Details — Splinter Review

Few little tweaks and removed full-text support as this currently can't be done directly. Full-text support should be worked on later.

Attachment #562295 - Attachment is obsolete: true

Attachment #562687 - Flags: review?(mkanat)

Attachment #562295 - Flags: review?(mkanat)

Max Kanat-Alexander

Comment 59

•

13 years ago

Comment on attachment 562687 [details] [diff] [review] v10 Review of attachment 562687 [details] [diff] [review]: ----------------------------------------------------------------- This is wonderful! I'm really happy that you've taken on this work! This is looking much much better! I have a lot of comments on this patch, below, but I want you to know that it's totally normal to have this many comments on a patch of this size, and that I'm really really happy with the work you're doing here. :-) ::: C:/Bugzilla/MSSQL/Bugzilla/DB/Mssql.pm @@ +10,5 @@ > +# implied. See the License for the specific language governing > +# rights and limitations under the License. > +# > +# The Original Code is the Bugzilla Bug Tracking System. > +# This needs an Initial Developer section. @@ +13,5 @@ > +# The Original Code is the Bugzilla Bug Tracking System. > +# > +# Contributor(s): > +# Michael Thomas <mockodin@gmail.com> > +# Jasmin Sehic <jasmins@embedcard.com> Nit: Only two-space indent past Contributor(s) below. @@ +17,5 @@ > +# Jasmin Sehic <jasmins@embedcard.com> > + > +package Bugzilla::DB::Mssql; > +use base qw(Bugzilla::DB); > +use strict; use strict should go above use base. @@ +29,5 @@ > + > +use constant EMPTY_STRING => ''; > + > +# needed for correct BLOB binding in MSSQL > +use constant BLOB_TYPE => undef; Nit: Extra spaces before "undef" @@ +31,5 @@ > + > +# needed for correct BLOB binding in MSSQL > +use constant BLOB_TYPE => undef; > + > +# needed to avoid data truncation by increasing LongReadLen Is this for blobs or for other fields as well? @@ +32,5 @@ > +# needed for correct BLOB binding in MSSQL > +use constant BLOB_TYPE => undef; > + > +# needed to avoid data truncation by increasing LongReadLen > +use constant LONG_READ_LEN => 5000000; Nit: There's an extra space before => Also, Perl lets you insert _ anywhere into numbers for readability--for long numbers like this it would probably be best to do: 50_000_000 @@ +38,5 @@ > +# odbc_default_bind_type '-9' is SQLWCHAR which allows for NVARCHAR > +use constant ODBC_BIND_TYPE => '-9'; > + > +# SQL Server Native Client version default is SQL 2008 => 10.0 > +use constant CLI_VERSION => '10.0'; Okay. What should happen if users are using a different version? Perhaps put a comment here helping me and others to understand how this should be edited for various versions, and if any other work is required. @@ +69,5 @@ > + > +sub bz_explain { > + my ($self, $sql) = @_; > + # effectly does nothing but allow SQL to display > + return ''; You can actually just leave this empty--see Bugzilla::DB::Sqlite and copy what's there. @@ +76,5 @@ > +sub bz_last_key { > + my ($self) = @_; > + my ($last_insert_id) = $self->selectrow_array('SELECT @@IDENTITY'); > + return $last_insert_id; > +} Does the normal DBI last_insert_id work? If so, we don't need to override bz_last_key. @@ +86,5 @@ > +} > + > +sub sql_regexp { > + my ($self, $expr, $pattern) = @_; > + $pattern =~ s/\'\[\[\:cntrl\:\]\]\'/\'\\\\x00-\\\\x1f\'/igo; You don't need to escape the single-quote. You also don't need to escape the colon. It looks like you're replacing a character class with a literal, though: [[:cntrl:]] becomes: \\x00-\\x1f When really I would think what you want is for: [:cntrl:] to become: \\x00-\\x1f Also you always want to replace [:cntrl:], not just when it's the whole string. So you don't need to check for or replace the single-quotes, either, I would imagine. @@ +92,5 @@ > +} > + > +sub sql_not_regexp { > + my ($self, $expr, $pattern) = @_; > + $pattern =~ s/\'\[\[\:cntrl\:\]\]\'/\'\\\\x00-\\\\x1f\'/igo; Same comment there. Also might want to abstract this out into a separate method instead of duplicating the code. @@ +98,5 @@ > +} > + > +sub sql_limit { > + my ($self, $limit, $offset) = @_; > + if(defined $offset) { Nit: Space after "if" @@ +109,5 @@ > + my ($self, @params) = @_; > + my @concat_strings; > + foreach my $string (@params) { > + push @concat_strings, "CAST($string as nvarchar(max))"; > + } You could do this with a map instead: my @concat_strings = map { "CAST($_ AS NVARCHAR(MAX))" } @params; @@ +120,5 @@ > +} > + > +sub sql_from_days { > + my ($self, $days) = @_; > + return "dateadd(dd, 0, ($days))"; Generally we like to capitalize anything that's SQL, so I would go with DATEADD here. @@ +125,5 @@ > +} > + > +sub sql_to_days { > + my ($self, $date) = @_; > + return "datediff(dd, 0, $date)"; And DATEDIFF here. @@ +142,5 @@ > +} > + > +sub sql_iposition { > + my ($self, $fragment, $text) = @_; > + return "CAST(dbo.INSTR($fragment, $text, 1) AS NVARCHAR(MAX))"; iposition should return an integer, not an nvarchar. @@ +147,5 @@ > +} > + > +sub sql_position { > + my ($self, $fragment, $text) = @_; > + return "CAST(dbo.INSTR($fragment, $text, 0) AS NVARCHAR(MAX))"; Same there, should be an integer, not an nvarchar. @@ +154,5 @@ > +sub sql_group_by { > + my ($self, $needed_columns, $optional_columns) = @_; > + return ($optional_columns ? "GROUP BY $needed_columns, $optional_columns" : > + "GROUP BY $needed_columns"); > +} This is the same as the base sql_group_by, so this override isn't needed. @@ +161,5 @@ > + my ($self, $column, $separator, $sortorder) = @_; > + $column = trim($column); > + > + if(defined $separator) { > + $separator = "'$separator'" if $separator !~ /^'.*'$/; You should never quote things manually in Bugzilla's DB code, you should always use $dbh->quote (or in this case, $self->quote). However, presently if we pass in a separator, it should be quoted, so this code isn't necessary here at all. @@ +163,5 @@ > + > + if(defined $separator) { > + $separator = "'$separator'" if $separator !~ /^'.*'$/; > + } else { > + $separator = "" The separator should default to a comma and space if not specified--look at Bugzilla::DB::Pg for an example. @@ +168,5 @@ > + } > + > + if(defined $sortorder) { > + $sortorder = 'NTRL' unless $sortorder =~ /ASC|DESC/; > + $sortorder = "'$sortorder'" if $sortorder !~ /^'.*'$/; Same note there about quotes--you should always use $self->quote and never quote manually. @@ +185,5 @@ > + my ($self) = @_; > + > + # drop functions > + $self->_bz_drop_object("[dbo].[FROM_DAYS]", > + "N'FN', N'IF', N'TF', N'FS', N'FT'", Looks like this line is repeated. Perhaps what would be best would be to add a _bz_add_function($name, $code) that could do this drop before the function was created. @@ +267,5 @@ > + IF EXISTS ( > + SELECT name FROM sys.procedures > + WHERE name = N'sp_alter_column_default' and type = 'P' > + ) > + DROP PROC [sp_alter_column_default]"); Same note here, we should have a _bz_add_procedure. @@ +289,5 @@ > + exec('alter table '+@table+' drop constraint '+@constraint) > + END > + exec('alter table [dbo].['+@table+'] > + add default ('''+@value+''') for ['+@column+']') > + END My eyes start to glaze over when I read these, because I don't know the MS-SQL stored query language, or the MS-SQL information schema tables layout. Perhaps ask on the developers list for somebody who's familiar with this to take a look at it and give it an informal review? @@ +337,5 @@ > + push @{ $self->{schema}{$table}{FIELDS} }, > + "$table\_id" => { TYPE => 'MEDIUMSERIAL', > + NOTNULL => 1, PRIMARYKEY => 1, }; > + } > + } Ah, you should never access {schema} directly, you should always use the methods that Schema provides, and access it via _bz_real_schema or _bz_schema. Also, why is this being done? If we have to have a PK, can we simply add it after creating the table, during bz_add_table, instead? Also, you will have to modify all of the schema-modification functions to address that there is an "invisible" PK on the table, in the case where they add a PK. This seems like quite a bit of complexity. @@ +368,5 @@ > + > +# Add MSSQL assemblies from contrib > +sub _bz_add_assembly { > + my ($self, $name) = @_; > + my $file = "contrib/$name.dll"; You'll need to use one of the bz_locations() values for that directory name--it's not always consistent and you're not always guaranteed to be running in the directory where your .cgi file is. I'm guessing the libpath (or whatever it's called) would work? @@ +371,5 @@ > + my ($self, $name) = @_; > + my $file = "contrib/$name.dll"; > + > + # load entire CLR assembly from file > + open (FILE, $file) || die "Can't open $file: $!\n"; Ah, it's best to never use the two-argument form of open, and to always use the three-argument form. Also, it's best not to use a global "glob" like FILE there. Instead, you can do: open(my $fh, '<', $file) || die "$file: $!"; @@ +375,5 @@ > + open (FILE, $file) || die "Can't open $file: $!\n"; > + local $/; > + binmode FILE; > + my $assembly = <FILE>; > + close (FILE); The best way to do this is instead: my $assembly; binmode $assembly; { local $/; $assembly = <$fh>; } close($fh) || warn "$file: $!"; @@ +389,5 @@ > + WITH PERMISSION_SET = SAFE"); > +} > + > +# used by adjust statement to generate a unique column alias > +sub _get_random_name { Why aren't you using generate_random_password from Bugzilla::Util? @@ +390,5 @@ > +} > + > +# used by adjust statement to generate a unique column alias > +sub _get_random_name { > + my @chars=('a'..'z','A'..'Z'); Nit: Spaces around = @@ +393,5 @@ > +sub _get_random_name { > + my @chars=('a'..'z','A'..'Z'); > + my $random_string; > + foreach (1..22) { > + $random_string .= $chars[rand @chars]; Ah, never use Perl's rand in the Bugzilla codebase. (Sorry, this isn't documented anywhere, but it's now true.) Instead you want Bugzilla::RNG::irand, but you probably don't need this function at all, so we're probably fine there. @@ +413,5 @@ > + $?CAST\((\b[\w\.]+\(?$?)\s+as\s+(\w+$?$?)\) # date is $1 and $2 is type > + \s*(\+|\-)\s* # + or - is $3 > + \/\*INTERVAL\s*(\d+|\?)\s*(\b\w+\b)\*\/\)? # INTERVAL comment $4 & $5 > + \s*(\+|\-)\s* # + or - is $6 > + \/\*INTERVAL\s*(\d+|\?)\s*(\b\w+\b)\*\/ # INTERVAL comment $7 & $8 Wow, these are so much clearer now, thank you so much. :-) Instead of looking at nesting, why don't you just keep re-running the regex until you don't see INTERVAL in there anymore? @@ +427,5 @@ > + $?(\b[\w\.]+\(?$?) # date as $1 > + \s*(\+|\-)\s* # + or - is $2 > + \/\*INTERVAL\s+(\d+|\?)\s+(\b\w+\b)\*\/\)? # INTERVAL comment $3 & $4 > + \s*(\+|\-)\s* # + or - is $5 > + \/\*INTERVAL\s+(\d+|\?)\s+(\b\w+\b)\*\/ # INTERVAL comment $6 & $8 Same here. Also, why do we even care about CAST or not CAST? @@ +456,5 @@ > + > + # replace with MSSQL dateadd function instead of interval > + /DATEADD($4, $2CAST($3 AS int), $1) > + > + /xigo; You probably don't want "i" in there or in any of these interval regexes actually, because your INTERVAL should always be in caps, since that's what you returned from sql_interval. @@ +459,5 @@ > + > + /xigo; > + > + # Length replace > + $sql =~ s/\bLENGTH\b$(.*)$/LEN($1)/igo; You probably want to make that .*? instead of just .* @@ +462,5 @@ > + # Length replace > + $sql =~ s/\bLENGTH\b$(.*)$/LEN($1)/igo; > + > + # Substring replace > + $sql =~ s/SUBSTRING$(.*) FROM (\d+) FOR (\d+)$/SUBSTRING($1, $2, $3)/igo; Same for that .* there. @@ +465,5 @@ > + # Substring replace > + $sql =~ s/SUBSTRING$(.*) FROM (\d+) FOR (\d+)$/SUBSTRING($1, $2, $3)/igo; > + > + # Modulo replace > + $sql =~ s/\bMOD\b$\s*(.*)\s*,\s*(.*)\s*$/($1 % $2)/igo; Does that work on our other DBs, just using the % operator? If it does, we should just switch. @@ +470,5 @@ > + > + return $sql; > +} > + > +sub adjust_statement { A lot of this is shared with the Oracle driver, right? If so, most or all of this method should be moved up into Bugzilla::DB. @@ +565,5 @@ > + } > + return $new_sql; > +} > + > +sub do { And yeah, my comment about StatementModifier is still true, let's definitely still do that.

Attachment #562687 - Flags: review?(mkanat) → review-

Max Kanat-Alexander

Updated

•

13 years ago

Assignee: mockodin → jasmins

Jasmin Sehic

Assignee

Comment 60

•

13 years ago

Attached patch v11 — Details — Splinter Review

Attachment #562687 - Attachment is obsolete: true

Attachment #564013 - Flags: review?(mkanat)

Jasmin Sehic

Assignee

Assignee

Comment 72

•

12 years ago

I also have a patch for this but I need to write as simple SQL statement parser as I can in order to inject ORDER BY clause in SQL statements that specify LIMIT without the ORDER BY clause. In MSSQL 2012 you can now do LIMIT but ORDER BY clause must be present. This poses the same problem MKanat brought up with the current way of limiting results which doesn't work so well for nested SQL statements hence needing to parse entire SQL statement to ensure no literal strings are corrupted and nested SQL statements are supported.

Initial Patch for Bugzilla::DB::Mssql 16 years ago Michael Thomas (Mockodin) 25.75 KB, patch		Details \| Diff \| Splinter Review
Patch for Bugzilla::DB::Mssql 16 years ago Michael Thomas (Mockodin) 25.76 KB, patch		Details \| Diff \| Splinter Review
Patch for Bugzilla::DB::Mssql 16 years ago Michael Thomas (Mockodin) 20.91 KB, patch		Details \| Diff \| Splinter Review
Patch for Bugzilla::DB::Mssql 16 years ago Michael Thomas (Mockodin) 20.17 KB, patch		Details \| Diff \| Splinter Review
Patch v5 15 years ago Michael Thomas (Mockodin) 24.31 KB, patch		Details \| Diff \| Splinter Review
v6 15 years ago Michael Thomas (Mockodin) 51.39 KB, patch		Details \| Diff \| Splinter Review
v7 15 years ago Michael Thomas (Mockodin) 19.21 KB, patch		Details \| Diff \| Splinter Review
v7 15 years ago Michael Thomas (Mockodin) 89.65 KB, patch	mkanat : review-	Details \| Diff \| Splinter Review
v8 13 years ago Jasmin Sehic 55.57 KB, patch		Details \| Diff \| Splinter Review
v8 13 years ago Jasmin Sehic 55.58 KB, text/plain		Details
v8 13 years ago Jasmin Sehic 55.58 KB, patch		Details \| Diff \| Splinter Review
v9 13 years ago Jasmin Sehic 23.79 KB, patch		Details \| Diff \| Splinter Review
v10 13 years ago Jasmin Sehic 23.31 KB, patch	mkanat : review-	Details \| Diff \| Splinter Review
v11 13 years ago Jasmin Sehic 22.53 KB, patch	mkanat : review-	Details \| Diff \| Splinter Review