Ensure/Refactor cron_retest() query to only re-test the last file on an issue; Automatically retest RTBC patches [#675460]

Comment	File	Size	Author
#28	675460_28.patch	2.38 KB	jthorson
#26	675460_26.patch	13.78 KB	jthorson
#25	675460_25.patch	13.46 KB	jthorson
#24	675460_24.patch	13 KB	jthorson
#22	675460.patch	10.71 KB	jthorson
#17	testing_new_query_18.patch	3.63 KB	jthorson
#16	testing_new_query_16.patch	2.86 KB	jthorson
#12	testing_new_query_2.patch	2.88 KB	jthorson
#14	testing_new_query_14.patch	2.79 KB	jthorson
#11	testing_new_query.patch	2.57 KB	jthorson

Comment #1

boombatower commented 6 January 2010 at 05:54

Title:

Ensure/Refactor cron_rest() query to only re-test the last file on an issue

» Ensure/Refactor cron_retest() query to only re-test the last file on an issue

Log in or register to post comments

Comment #2

boombatower commented 6 January 2010 at 06:14

Using the following test code I was able to determine that the query does infact affect all patches on an issues instead of just the last.

  $api_versions = pift_core_api_versions();
  $sids = variable_get('pift_status', array());
//   $retest_time = time() - PIFT_RETEST * 20;
  $retest_time = time();
//   $retest_time = 0;
  
  drupal_set_message(print_r($api_versions, TRUE));
  drupal_set_message(print_r($sids, TRUE));
  drupal_set_message($retest_time);
  
  foreach (array('u.nid = pi.nid', 'cu.nid = pi.nid') as $clause) {
    $result = db_query("SELECT *
              FROM {pift_test}
              WHERE type = %d
              AND id IN (
                SELECT f.fid
                FROM {files} f
                LEFT JOIN {upload} u
                  ON f.fid = u.fid
                LEFT JOIN {comment_upload} cu
                  ON f.fid = cu.fid

                JOIN {project_issues} pi
                  ON " . $clause . "
                JOIN {pift_project} p
                  ON pi.pid = p.pid
                JOIN {project_release_nodes} r
                  ON pi.rid = r.nid

                JOIN {node} n
                  ON r.nid = n.nid
                JOIN {term_node} t
                  ON (n.vid = t.vid AND t.tid IN (" . db_placeholders($api_versions, 'int') . "))

                WHERE pi.sid IN (" . db_placeholders($sids, 'int') . ")
              )
              AND status > %d
              AND last_tested < %d",
              array_merge(array(PIFT_TYPE_FILE), $api_versions, $sids,
                          array(PIFT_STATUS_SENT, $retest_time)));
    while ($file = db_fetch_array($result)) {
      drupal_set_message(print_r($file, TRUE));
    }
  }

I am not really sure how to fix this since we cannot merge the node and comment table. Otherwise, have to do a select and then update inside a loop which doesn't sound good.

Log in or register to post comments

Comment #6

webchick

she/they

English

Vancouver 🇨🇦

commented 26 April 2011 at 02:15

Subscriiiiibe!

Log in or register to post comments

Comment #7

catch

he/him

English

commented 6 June 2011 at 14:31

Subscribing, I assume this is why patch re-testing is switched off at the moment.

Log in or register to post comments

Comment #8

rfay

he,him

English

Palisade, CO, USA

commented 6 June 2011 at 15:21

Title:

Ensure/Refactor cron_retest() query to only re-test the last file on an issue

» Ensure/Refactor cron_retest() query to only re-test the last file on an issue; Automatically retest RTBC patches

Log in or register to post comments

Comment #9

jthorson commented 27 September 2011 at 21:47

.

Log in or register to post comments

Comment #10

jthorson commented 5 October 2011 at 01:36

Similar historical issue located at #635334: Patches are not being re-tested automatically.

Log in or register to post comments

Comment #11

jthorson commented 23 November 2011 at 03:10

Status	File	Size
new	testing_new_query.patch	2.57 KB

Attached patch contains a new 'Select' query for validation purposes before implementing the actual update.

Note: This patch is dependent on #1348958-6: Add 'nid' column to pift_test landing first.

Log in or register to post comments

Comment #12

jthorson commented 23 November 2011 at 06:59

Status	File	Size
new	testing_new_query_2.patch	2.88 KB

~~Needs some optimization, but this appears to work:~~

Log in or register to post comments

Comment #13

jthorson commented 23 November 2011 at 06:36

Ooops. Wrong version. Getting late. :(

Log in or register to post comments

Comment #14

jthorson commented 23 November 2011 at 06:58

Status	File	Size
new	testing_new_query_14.patch	2.79 KB

Again (with a select instead of update).

Log in or register to post comments

Comment #15

jthorson commented 23 November 2011 at 08:00

Status:

Active

» Needs work

Edit: Had an issue with a named constant.

Log in or register to post comments

Comment #16

jthorson commented 24 November 2011 at 02:15

Status	File	Size
new	testing_new_query_16.patch	2.86 KB

New candidate for testing/validation, but hard-coded to only work once, and trigger a very small number of retests (instead of flooding us with 1100+ tests, which is what would happen if we put this live without a careful launch strategy).

Log in or register to post comments

Comment #17

jthorson commented 24 November 2011 at 07:33

Status:

Needs work

» Needs review

Status	File	Size
new	testing_new_query_18.patch	3.63 KB

Last patch was a non-starter.

This one works. :)

But because enabling this could totally overload the testbots for a few days, I've left some safety checks in place. To actually enable re-testing using this patch, comment out the $retest_time = 1; line near the beginning of pift.cron.inc and reinstate the line before it.

We'll need to come up with a plan for a 'staged' retesting until we've cleared the backlog ... my thought was that enabling for "drush pift-cron", but not for pift_cron() would allow us to manually trigger batch retests as needed.

Log in or register to post comments

Comment #18

jthorson commented 25 November 2011 at 05:51

Deployment Options:

1. Add a new drush command which accepts an argument 'X' and processes 'X' retests upon execution; leaving the 'retest' variable at 'disabled' until the backlog is cleared ... manually queueing during slow periods, and setting it up as a jenkins job once we know a reasonable value for 'X'.

2. Add the current 'queue depth' to the communication from PIFR to PIFT, and queue x retests based on queue depth triggers.

TODO: Update the 'retest interval' options to include weeks/months.

Log in or register to post comments

Comment #19

boombatower commented 28 November 2011 at 21:49

Not a 100% sure what we did originally when this feature was turned on. Can we simply have a drush command for retesting that we then call on a different interval say twice a day and it only sends 30-40 tests until we are caught up? I mean having the the retests run an a slower interval is probably better anyway since they are lower priority.

Log in or register to post comments

Comment #20

jthorson commented 29 November 2011 at 15:05

Yup ... that was the thought (Option 1).

The second thought was more for documentation's sake, and would probably be a longer term feature ... added it here so that I didn't forget it the next morning.

Log in or register to post comments

Comment #21

sun

German

Karlsruhe

commented 12 January 2012 at 03:28

Status:

Needs review

» Needs work

+++ b/pift.cron.inc
@@ -16,47 +16,47 @@ function pift_cron_retest() {
+    //$retest_time = time() - PIFT_RETEST;

Stale commented out debug code?

+++ b/pift.cron.inc
@@ -16,47 +16,47 @@ function pift_cron_retest() {
+    // Get last file test from each nid which is in 'needs review' or 'rtbc'.
+    $result = db_query(

Can you run an EXPLAIN on this query and post the results?

That is, because this query looks very expensive and potentially very long-running to me.

+++ b/pift.cron.inc
@@ -16,47 +16,47 @@ function pift_cron_retest() {
+        JOIN {project_issues} pi ON pt.nid = pi.nid
+        JOIN {project_release_nodes} prn ON pi.rid = prn.nid
+        JOIN {node} n ON prn.nid = n.nid
+        JOIN {term_node} t

JOINs should always be explicit INNER JOIN or LEFT JOIN.

+++ b/pift.cron.inc
@@ -16,47 +16,47 @@ function pift_cron_retest() {
+    // Update status for tests with final results (Passed or Failed)
+    // and which have not been tested in the retest period.

I've troubles to understand this comment. If I get the query correctly, then matching patches/tests are getting reset to be "queued"?

+++ b/pift.cron.inc
@@ -16,47 +16,47 @@ function pift_cron_retest() {
+      "Update {pift_test} set status = %d
...
+        ORDER BY test_id ASC
+        LIMIT 50",

1) All SQL statement keywords, such as UPDATE and SET should be all-uppercase.

2) ORDER BY and LIMIT is seems bogus here.

+++ b/pift.cron.inc
@@ -16,47 +16,47 @@ function pift_cron_retest() {
+    // TODO:  Remove 'Limit 50' above and DSM() before enabling in pift.module!
+    drupal_set_message("Re-Tests Queued: " . db_affected_rows());

A cron callback should not trigger dsm().

Log in or register to post comments

Comment #22

jthorson commented 2 March 2014 at 07:19

Issue summary:

View changes

Status	File	Size
new	675460.patch	10.71 KB

4 files were hidden/shown/deleted

Status	File	Size
hidden	testing_new_query.patch	2.57 KB
hidden	testing_new_query_14.patch	2.79 KB
hidden	testing_new_query_2.patch	2.88 KB
hidden	testing_new_query_16.patch	2.86 KB

Updated for the D7 drupal.org upgrade ... D7 Project* changes made the query look worse.

Will begin testing this tomorrow.

Log in or register to post comments

Comment #23

jthorson commented 2 March 2014 at 07:23

Parent issue:

» #1952058: [META] Retesting stale RTBC core patches

Log in or register to post comments

Comment #24

jthorson commented 3 March 2014 at 01:10

Status:

Needs work

» Needs review

Status	File	Size
new	675460_24.patch	13 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	testing_new_query_18.patch	3.63 KB
hidden	675460.patch	10.71 KB

Updated version.

Log in or register to post comments

Comment #25

jthorson commented 3 March 2014 at 01:56

Status	File	Size
new	675460_25.patch	13.46 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	675460_24.patch	13 KB

Release candidate

Log in or register to post comments

Comment #26

jthorson commented 3 March 2014 at 04:27

Status	File	Size
new	675460_26.patch	13.78 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	675460_25.patch	13.46 KB

Committed version.

Will be included in the next PIFT deploy, but the timeline for enabling the feature will be dependent on i) the number of outstanding RTBC retests, ii) confirming that the retest query doesn't negatively impact the drupal.org database, and iii) sorting out which option we use for actually triggering the retests.

Log in or register to post comments

Comment #27

jthorson commented 3 March 2014 at 05:15

Status:

Needs review

» Needs work

Close, but first tests on prod uncovered that the query needs a bit of refactoring.

Right now, it chooses the 'last passed file that has not already been sent' per RTBC issue, as opposed to the 'last passed file *if it* has not already been sent'.

Log in or register to post comments

Comment #28

jthorson commented 3 March 2014 at 22:49

Status:

Needs work

» Needs review

Status	File	Size
new	675460_28.patch	2.38 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	675460_26.patch	13.78 KB

Okay, this looks better!

Log in or register to post comments

Comment #29

jthorson commented 9 March 2014 at 22:12

Okay, did some more work on this, to try and see why it was negatively affecting drupal.org when I tried to enable it. I've now got pift-cron running only once per minute, and cleared out a test linking to a deleted file that was gumming things up ... and the PIFT logs look a lot more 'normal' now. There's a good chance that this query was trying to run twice concurrently.

The retest query itself looks like this:

SELECT pd.test_id AS test_id
FROM
pift_data pd
INNER JOIN (SELECT MAX(pd2.test_id) AS maxid
FROM
pift_data pd2
GROUP BY nid) maxids ON pd.test_id = maxids.maxid
INNER JOIN node n ON pd.nid = n.nid
INNER JOIN field_data_field_issue_status fdfis ON n.nid =
fdfis.entity_id AND n.type = fdfis.bundle
INNER JOIN field_data_field_project fdfp ON n.nid = fdfp.entity_id
AND n.type = fdfp.bundle
WHERE (pd.type = 2) AND (pd.status =
4) AND (pd.last_tested <
1394402320) AND (fdfis.entity_type =
'node') AND (fdfis.bundle IN
('project_issue')) AND (fdfis.field_issue_status_value IN
('14')) AND (fdfp.field_project_target_id IN
(3060))
ORDER BY pd.last_tested ASC
LIMIT 3 OFFSET 0

With the EXPLAIN pastebinned at http://privatepaste.com/9b19b39c2b

So ... any mysql optimizers able to help improve this?

Log in or register to post comments

Comment #30

drumm

he/him

NY, US

commented 10 March 2014 at 06:59

Add ORDER BY NULL to the subquery:

(SELECT MAX(pd2.test_id) AS maxid FROM pift_data pd2 GROUP BY nid ORDER BY NULL)

GROUP BY implies a sort, which isn't too relevant in a subquery.

Log in or register to post comments

Comment #31

drumm

he/him

NY, US

commented 10 March 2014 at 18:37

If any of the WHERE conditions on pd can also be added to pd2 in the subquery, that will reduce the number of rows examined.

Otherwise, the query is okay, since it is not run concurrently.

Log in or register to post comments

Comment #32

jthorson commented 10 March 2014 at 19:00

The 'where' conditions need to be outside, as otherwise we get the last patch that meets the criteria, instead of getting the last patch and THEN seeing if it meets the criteria.

In any case, I've got a patch that restructures this into an EFQ to get the RTBC issues, and then uses php to filter out the last test. Will look at deploying for testing today.

Log in or register to post comments

Comment #33

jthorson commented 12 March 2014 at 13:56

Status:

Needs review

» Fixed

The revamped patch (using EFQ to get issues, then loading all tests and sorting via PHP) is now live, via a new jenkins job.

The job will requeue up to 5 RTBC issues every 15 minutes. There are in the neighborhood of 400 RTBC patches given all of core and contrib, but 2/3 of these are D6/contrib tests which complete testing within just a couple minutes each; so they should not have a heavy impact on the queue. Core alone has 108 tests, which includes all of D6/D7/D8.

Log in or register to post comments

Comment #34

26 March 2014 at 14:01

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Log in or register to post comments

Ensure/Refactor cron_retest() query to only re-test the last file on an issue; Automatically retest RTBC patches

Comments

Parent issue

Referenced by