Closed Bug 1579448 Opened 6 years ago Closed 6 years ago

Multiple changes to NSPR tests, find the mimimum set of tests that currently work on all platforms, prepare to reenable CI in 2019.

Tracking

(Not tracked)

Status:

RESOLVED FIXED

Milestone:

4.25

People

(Reporter: KaiE, Assigned: KaiE)

References

Details

Attachments

(3 files, 6 obsolete files)

1579448-v1.patch 6 years ago Kai Engert [:KaiE:] 5.69 KB, patch		Details \| Diff \| Splinter Review
1579448-v2.patch 6 years ago Kai Engert [:KaiE:] 17.48 KB, patch		Details \| Diff \| Splinter Review
FreeBSD: build/run tests log 6 years ago Jan Beich 162.63 KB, text/plain		Details
1579448-v3.patch 6 years ago Kai Engert [:KaiE:] 13.07 KB, patch		Details \| Diff \| Splinter Review
nspr-runtest.patch 6 years ago Kai Engert [:KaiE:] 1.12 KB, patch		Details \| Diff \| Splinter Review
1579448-v4.patch 6 years ago Kai Engert [:KaiE:] 16.36 KB, patch		Details \| Diff \| Splinter Review
FreeBSD: build/run tests log 6 years ago Jan Beich 158.40 KB, text/plain		Details
1579448-v5.patch [checked in] 6 years ago Kai Engert [:KaiE:] 34.42 KB, patch	jbeich : review+	Details \| Diff \| Splinter Review
1579448-hide.patch 6 years ago Kai Engert [:KaiE:] 423 bytes, patch	KaiE : review? jbeich	Details \| Diff \| Splinter Review

Kai Engert [:KaiE:]

Assignee

Description

•

6 years ago

I've tried to run the NSPR tests today on Linux 64bit.

The multiwait test fails. I've gone back to NSPR 4.4 from 2 which is the oldest version that I can build on my current system. The multiwait test was already failing in 4.4 so I suggest we disable it.

One more test was failing: fdcach

I found that it has started to fail with NSPR 4.10.8, because of bug 573192 which had removed the stack based fd cache. We should remove the corresponding tests.

Jan, does that make sense to you?

What results do you get if you execute NSPR runtests.sh on your system, same or more failures?

Kai Engert [:KaiE:]

Assignee

Comment 1

•

6 years ago

Attached patch 1579448-v1.patch (obsolete) — Details — Splinter Review

Assignee: nobody → kaie

Attachment #9091046 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Comment 4

•

6 years ago

There are two scripts to run the scripts, sh and pl.

Jan, which one do you usually use? I don't know why are there two. The README.TXT suggests that the .pl might be more powerful, and control the set of executed tests per platform, which is supposed to be defined in a file runtests.txt, but that txt file doesn't exist.

Kai Engert [:KaiE:]

Assignee

Comment 5

•

6 years ago

Attached patch 1579448-v2.patch (obsolete) — Details — Splinter Review

I'm trying to run the NSPR tests in taskcluster.
It uses hardcoded filenames in /tmp/ which we should avoid. I've changed the code to use files in the local directory instead.

There are multiple ways to run the tests:

a runtest.sh script
a runtest.pl script
a Makefile rule runtests

Because the Makefile attempts to run all tests, but the scripts only run a subset (because some tests are known to fail), we should change the Makefile target to execute one of the scripts. (It's best for me to run a make target from taskcluster).

I've chosen the .sh script, as that reports the failure correctly to the caller with an exit code, allowing the make target to detect the failure.

Attachment #9091046 - Attachment is obsolete: true

Attachment #9091046 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Attachment #9091390 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Summary: Adjust set of executed NSPR tests → NSPR tests shouldn't use /tmp; exclude failing multiwait; Make target runtests should call runtests.sh

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Summary: NSPR tests shouldn't use /tmp; exclude failing multiwait; Make target runtests should call runtests.sh → NSPR tests shouldn't use /tmp; exclude failing multiwait; Make target runtests should call runtests.sh; remove FD stack testing

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Blocks: 1579836

Jan Beich

Comment 6

•

6 years ago

Attached file FreeBSD: build/run tests log (obsolete) — Details

Changes after applying attachment 9091390 [details] [diff] [review]:

--- before
+++ after
@@ -2700,7 +2700,7 @@ dlltest			Passed
 dtoa			Passed
 errcodes			Passed
 exit			Passed
-fdcach			FAILED
+fdcach			Passed
 fileio			Passed
 foreign			Passed
 formattm			Passed
@@ -2729,7 +2729,6 @@ lockfile			Passed
 logfile			Passed
 logger			Passed
 many_cv			Passed
-multiwait			Passed
 nameshm1			Passed
 nblayer			Passed
 nonblock			Passed

Jan Beich

Updated

•

6 years ago

Attachment #9091422 - Attachment mime type: text/x-log → text/plain

Jan Beich

Comment 7

•

6 years ago

(In reply to Kai Engert (:kaie:) from comment #0)

The multiwait test fails. I've gone back to NSPR 4.4 from 2 which is the
oldest version that I can build on my current system. The multiwait test was
already failing in 4.4 so I suggest we disable it.

multiwait test passes on FreeBSD, even on 11.2. I can't test other BSDs atm.

One more test was failing: fdcach

Thanks for fixing it. Downstream Mikhail fixed and enabled a few more tests: forktest, nbconn, poll_err.
https://github.com/freebsd/freebsd-ports/blob/173ce3c7ca54/devel/nspr/files/patch-tests

(In reply to Kai Engert (:kaie:) from comment #4)

There are two scripts to run the scripts, sh and pl.

Only sh but also run library tests. I don't run tests regularly because it takes too long.
https://github.com/freebsd/freebsd-ports/blob/173ce3c7ca54/devel/nspr/Makefile#L40-L42

Jan Beich

Comment 8

•

6 years ago

•

Edited

Comment on attachment 9091390 [details] [diff] [review] 1579448-v2.patch Review of attachment 9091390 [details] [diff] [review]: ----------------------------------------------------------------- ::: pr/tests/foreign.c @@ +232,5 @@ > +#define TEMP_DIR "./tmp" > +#if defined(WIN32) > + mkdir(TEMP_DIR); > +#else > + mkdir(TEMP_DIR, 0700); Why not replace with PR_MkDir to avoid conditional? ::: pr/tests/sema.c @@ +8,5 @@ > > #include <stdio.h> > > +#define SEM_NAME1 "./tmp-foo.sem" > +#define SEM_NAME2 "./tmp-bar.sem" Omitting leading / (slash) may not work on _PR_HAVE_POSIX_SEMAPHORES platforms. See also _pr_ConvertSemName. https://pubs.opengroup.org/onlinepubs/9699919799/functions/sem_open.html https://docs.oracle.com/cd/E23823_01/html/816-5171/sem-open-3rt.html http://nixdoc.net/man-pages/HP-UX/man2/sem_open.2.html

BugBot [:suhaib / :marco/ :calixte]

Comment 9

•

6 years ago

The priority flag is not set for this bug.
:jcj, could you have a look please?

For more information, please visit auto_nag documentation.

Flags: needinfo?(jjones)

J.C. Jones [:jcj] (he/they)

Updated

•

6 years ago

Flags: needinfo?(jjones)

Priority: -- → P1

Kai Engert [:KaiE:]

Assignee

Comment 10

•

6 years ago

Attached patch 1579448-v3.patch (obsolete) — Details — Splinter Review

Attachment #9091390 - Attachment is obsolete: true

Attachment #9091390 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Comment 11

•

6 years ago

Attached patch nspr-runtest.patch (obsolete) — Details — Splinter Review

Kai Engert [:KaiE:]

Assignee

Comment 12

•

6 years ago

Jan, thanks for your feedback and your suggestions. Using PR_Mkdir is great idea :)
I'm reverting the changes to semaphore naming.

I'd like to execute the NSPR tests as soon as possible, and while experimenting on win/mac/linux, unfortunately many tests are failing.

I didn't have time to investigate what's failing. I would like to enable all the tests that currently work as a baseline. Then, afterwards, we can try enabling the other tests one by one, and fix code as needed.

(FYI, several of my experiments can be found at https://treeherder.mozilla.org/#/jobs?repo=nss-try - some of the more recent ones print a debug log.)

I'm attach a new patch, that disables all the tests that currently fail on at least one platform.

Kai Engert [:KaiE:]

Assignee

Comment 13

•

6 years ago

Attached patch 1579448-v4.patch (obsolete) — Details — Splinter Review

Can you live with this?

Attachment #9105635 - Attachment is obsolete: true

Attachment #9105644 - Attachment is obsolete: true

Attachment #9105972 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Comment 14

•

6 years ago

Windows behavior is a bit erratic. As soon as I disable one test, some other test starts to fail... I just disabled "parent" in addition. I wonder if there'a s problem with concurrency, if more than one test runs on the same machine.

Kai Engert [:KaiE:]

Assignee

Comment 15

•

6 years ago

Comment on attachment 9105972 [details] [diff] [review] 1579448-v4.patch yes, I think it's about races for the same port. I want to change the port numbers based on debug/opt and 32/64 bit target.

Attachment #9105972 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Attachment #9105972 - Attachment is obsolete: true

Jan Beich

Comment 16

•

6 years ago

Comment on attachment 9105972 [details] [diff] [review] 1579448-v4.patch Review of attachment 9105972 [details] [diff] [review]: ----------------------------------------------------------------- ::: pr/tests/runtests.sh @@ +1,1 @@ > +#!/bin/bash On BSD systems Bash may not be installed by default. Default Bash location varies e.g., /usr/local/bin/bash on DragonFly, FreeBSD, OpenBSD; /usr/pkg/bin/bash on NetBSD. Both /usr/local and /usr/pkg are user-configurable via LOCALBASE variable (environment or make). Consider changing to: #!/usr/bin/env bash @@ +5,5 @@ > # file, You can obtain one at http://mozilla.org/MPL/2.0/. > > +if test -z $1 > +then > + echo "usage $0 <path-to-dist>" Missing : (colon) before "usage" and "$0". Also, can be simplified to : ${1:?"usage: $0 <path-to-dist>"} @@ +11,5 @@ > +fi > + > +pushd $1/lib > +ABS_LIB=`pwd` > +popd Why not keep POSIX compatibility? /bin/sh maybe faster than Bash e.g., on FreeBSD. cd $1/lib ABS_LIB=$PWD cd - https://pubs.opengroup.org/onlinepubs/9699919799/utilities/cd.html

Attachment #9105972 - Attachment is obsolete: false

Jan Beich

Comment 17

•

6 years ago

Attached file FreeBSD: build/run tests log — Details

pr/tests/parent sometimes fails on FreeBSD. Could be a race in the test or a bug in NSPR abstraction or a kernel bug. Compared to previous testing my CPU was 75% (or less) busy i.e., 6/8 logical cores dedicated to package building.

I've attached the log from a successful run with changes proposed in comment 16.

Attachment #9091422 - Attachment is obsolete: true

Kai Engert [:KaiE:]

Assignee

Comment 18

•

6 years ago

(In reply to Jan Beich from comment #16)

Consider changing to: #!/usr/bin/env bash

well, let's stay with /bin/sh and use your suggestion below.

Missing : (colon) before "usage" and "$0". Also, can be simplified to

: ${1:?"usage: $0 <path-to-dist>"}

I think that's difficult to read. I'd like to keep my simpler version.

cd $1/lib
ABS_LIB=$PWD
cd -

ok, with that we can keep /bin/sh

Kai Engert [:KaiE:]

Assignee

Comment 19

•

6 years ago

Attached patch 1579448-v5.patch [checked in] — Details — Splinter Review

This patch changes the port numbers, and names of semaphores, based on debug/opt and 32/64 bit, and should be more resistent to parallel execution of tests on the same machine.

Attachment #9105972 - Attachment is obsolete: true

Attachment #9105989 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Comment 20

•

6 years ago

https://treeherder.mozilla.org/#/jobs?repo=nss-try&revision=9f9a4a531b8e753601b89101b1d1a0ade88e4807

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Summary: NSPR tests shouldn't use /tmp; exclude failing multiwait; Make target runtests should call runtests.sh; remove FD stack testing → Multiple changes to NSPR tests, find the mimimum set of tests that currently work on all platforms, prepare to reenable CI in 2019.

Jan Beich

Comment 21

•

6 years ago

Comment on attachment 9105989 [details] [diff] [review] 1579448-v5.patch [checked in] Works fine on FreeBSD: tested 11.3 i386 and 13.0 amd64.

Attachment #9105989 - Flags: review?(jbeich) → review+

Jan Beich

Comment 22

•

6 years ago

•

Edited

Comment on attachment 9105989 [details] [diff] [review] 1579448-v5.patch [checked in] Review of attachment 9105989 [details] [diff] [review]: ----------------------------------------------------------------- ::: pr/tests/runtests.sh @@ +268,5 @@ > done > fi; > > +if [ $rval -ne 0 ]; then > + cat ${LOGFILE} Maybe bad UX outside of automation. When a test fails its name can be determined from the summary, so user can rerun the test manually and analyze its verbose output. However, verbose output doesn't annotate test failures, so printing large $LOGFILE may consume entire scroll buffer, leaving user guessing which test actually failed. ${PAGER:-cat} may help.

Kai Engert [:KaiE:]

Assignee

Comment 23

•

6 years ago

(In reply to Jan Beich from comment #22)

Maybe bad UX outside of automation. When a test fails its name can be
determined from the summary, so user can rerun the test manually and analyze
its verbose output. However, verbose output doesn't annotate test failures,
so printing large $LOGFILE may consume entire scroll buffer, leaving user
guessing which test actually failed.

${PAGER:-cat} may help.

Good point.

Alternatively, maybe an environment variable to disable the cat?

Kai Engert [:KaiE:]

Assignee

Comment 24

•

6 years ago

Attached patch 1579448-hide.patch — Details — Splinter Review

Attachment #9106110 - Flags: review?(jbeich)

Kai Engert [:KaiE:]

Assignee

Comment 25

•

6 years ago

Comment on attachment 9105989 [details] [diff] [review] 1579448-v5.patch [checked in] https://hg.mozilla.org/projects/nspr/rev/d03855113364d4a9f182a9f820b1787fd1100364

Attachment #9105989 - Attachment description: 1579448-v5.patch → 1579448-v5.patch [checked in]

Kai Engert [:KaiE:]

Assignee

Comment 26

•

6 years ago

Jan, do you want the incremental patch?

Target Milestone: --- → 4.24

Kai Engert [:KaiE:]

Assignee

Comment 27

•

6 years ago

Had to disable ntioto test because it fails.
https://hg.mozilla.org/projects/nspr/rev/50c1448672f16e98895a256575bdc0a382d1c0a5

Note the failure occurred with a revision that DIDN'T have the change from bug 1586070 yet.

Kai Engert [:KaiE:]

Assignee

Updated

•

6 years ago

Target Milestone: 4.24 → 4.25

Kai Engert [:KaiE:]

Assignee

Comment 28

•

6 years ago

Fixed, minus the pending change, which was optional and didn't get a review yet.

Status: NEW → RESOLVED

Closed: 6 years ago

Resolution: --- → FIXED

Mike Hommey [:glandium]

Comment 29

•

6 years ago

FWIW, multiwait passes on Debian.

You need to log in before you can comment on or make changes to this bug.