figure out why "FreeStorm Win7-amd64-mingw py2.6" is red on the "test-with-fake-pkg" step #1246
Labels
No Label
0.2.0
0.3.0
0.4.0
0.5.0
0.5.1
0.6.0
0.6.1
0.7.0
0.8.0
0.9.0
1.0.0
1.1.0
1.10.0
1.10.1
1.10.2
1.10a2
1.11.0
1.12.0
1.12.1
1.13.0
1.14.0
1.15.0
1.15.1
1.2.0
1.3.0
1.4.1
1.5.0
1.6.0
1.6.1
1.7.0
1.7.1
1.7β
1.8.0
1.8.1
1.8.2
1.8.3
1.8β
1.9.0
1.9.0-s3branch
1.9.0a1
1.9.0a2
1.9.0b1
1.9.1
1.9.2
1.9.2a1
LeastAuthority.com automation
blocker
cannot reproduce
cloud-branch
code
code-dirnodes
code-encoding
code-frontend
code-frontend-cli
code-frontend-ftp-sftp
code-frontend-magic-folder
code-frontend-web
code-mutable
code-network
code-nodeadmin
code-peerselection
code-storage
contrib
critical
defect
dev-infrastructure
documentation
duplicate
enhancement
fixed
invalid
major
minor
n/a
normal
operational
packaging
somebody else's problem
supercritical
task
trivial
unknown
was already fixed
website
wontfix
worksforme
No Milestone
No Assignees
2 Participants
Notifications
Due Date
No due date set.
Reference: tahoe-lafs/trac-2024-07-25#1246
Loading…
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
(Note that the two failures here have different causes. From comment:4 onwards, this ticket is about the second one.)
MM's buildslave "MM netbsd5 i386 warp" fails the test-from-egg step, like this:
FS's buildslave "FreeStorm Win7-amd64-mingw py2.6" fails the test-with-fake-pkg step, like this:
(In a subsequent build, this buildslave hung, and then in subsequent builds it was unable to start due to a process holding a handle to the
build
directory.)For the warp client, there isn't an allmydata egg except inside the egginstalldir itself; therefore the issue seems to be:
Code highlighting:
**Right in here... **
If I change the above os.chdir('.') to os.chdir('egginstalldir') then the test proceeds as normal. Since this is ostensibly a test from the egginstalldir location, I fail to see why it should be globbing the eggs from the enclosing directory prior to changing directory into the egginstalldir and then running the trial from that directory after overriding PYTHONPATH.
So I guess I am failing to pierce the meaning behind the way we are prepending the eggs' realpath() to PYTHONPATH from pre- to post- chdir().
Attachment buildmaster-search-parent-for-eggs.darcs.patch (81043 bytes) added
buildmaster bbsupport.py: have the test-from-egg step look for eggs in the egginstalldir as well as its parent. refs #1246
zooko: please apply buildmaster-search-parent-for-eggs.darcs.patch and restart the buildmaster.
Replying to davidsarah:
Done:
http://tahoe-lafs.org/buildbot/builders/MM%20netbsd5%20i386%20warp/builds/135
It looks like MM's buildslave is all fixed up now. FreeStorm's Win67-amd64-mingw-py2.6 is not, though:
http://tahoe-lafs.org/buildbot/builders/FreeStorm%20Win7-amd64-mingw%20py2.6/builds/154/steps/test-with-fake-pkg/logs/stdio
figure out why MM's and FS's buildslaves are red on some stepsto figure out why "FreeStorm Win7-amd64-mingw py2.6" is red on the "test-with-fake-pkg" stepReplying to [zooko]comment:3:
The
Requirements: ['allmydata-tahoe==1.8.0-r4803', ...]
line indicates what source:misc/build_helpers/run_trial.py set the*requires*_
variable to. That is supposed to tellpkg_resources
to insert entries onsys.path
so that only modules from exactly that version of Tahoe-LAFS will be imported. But in fact the installed version (which cannot be 1.8.0-r4803 because that number increases with each build) is being run.I asked freestorm to modify
run_trial.py
to print outsys.path
just before and just after importingpkg_resources
. The results show no specific version of theallmydata-tahoe
package on either of these paths. In particular, thesrc
directory of the Tahoe-LAFS distribution is not on the "before" path, as I would have expected. (That is the current directory when executingrun_trial.py
from source:misc/build_helpers/test-with-fake-pkg.py.)The code in
pkg_resources
that inserts path entries is difficult to follow, but I think that it only reorders entries and adds .egg files from directories that are already on thesys.path
, which would explain why it's not able to find the correct version ofallymydata-tahoe
. Perhaps it would help to explicitly add thesrc
directory to thePYTHONPATH
variable intest-with-fake-pkg.py
; I'll ask freestorm to try that.Attachment freestorm-test-with-fake-pkg-output.txt (6336 bytes) added
output of test-with-fake-pkg.py on freestorm's builder, with printing of sys.path before and after 'import pkg_resources' in run_trial.py
Replying to [davidsarah]comment:4:
Sorry, I was mistaken. It is on line 24 "before" and line 68 "after". The reason why the installed Tahoe is being picked up instead is that
c:\python26\lib\site-packages
is earlier thanc:\buildbot_tahoe_0.8.2\freestorm_win7-amd64-mingw_py2.6\build\src
on the "after" path, even though it is later on the "before" path. So importingpkg_resources
incorrectly changed the order of these two entries.Replying to [davidsarah]comment:5:
Freestorm and I had a long debugging session and worked out what was causing this. The buildslave has three distributions that are installed directly in
C:\Python26\lib\site-packages
:allmydata_tahoe-1.8.0c1_r4687-py2.6
,pywin32-214-py2.6.egg-info
andTwisted-10.1.0-py2.6
. Each is installed as an.egg-info
directory and source files with the same root (e.g.Twisted-10.1.0-py2.6
andtwisted
both insite-packages
).That meant that when
pkg_resources
tried to add an entry for Twisted (10.1.0) tosys.path
, it added the locationC:\Python26\lib\site-packages
, which effectively also added Tahoe 1.8.0c1-r4687 tosys.path
with the same priority. The correct Tahoe 1.8.0-r4803 was also later on the path, butpkg_resources
makes no attempt to order the requirements specified in*requires*
relative to each other..egg-info
installations directly intosite-packages
are apparently quite common :-( On my machine I have Tahoe (several .egg-infos for different versions), pywin32, Twisted 9.0.0, pycairo, pyflakes, and zetuptoolz installed in that way. The pywin32 and Twisted.exe
installers apparently do this by default.According to [http://stackoverflow.com/questions/256417/python-packages-and-egg-info-directories this stackoverflow answer], ".egg-info directories get only created if
--single-version-externally-managed
was used to install the egg". My conclusion from this is that we should never use--single-version-externally-managed
to install Tahoe into any directory where other distributions might also be installed.Anyway, Freestorm deleted the
allmydata
andallmydata_tahoe-1.8.0c1_r4687-py2.6.egg-info
directories fromsite-packages
(after taking a copy), and that [http://tahoe-lafs.org/buildbot/builders/FreeStorm%20Win7-amd64-mingw%20py2.6/builds/157/steps/test-with-fake-pkg/logs/stdio made test-with-fake-pkg go green].So if I understand correctly, the problem you are focussing on is that sometimes distributions get installed by copying their modules directly into some directory which is on
sys.path
. This makes it so thatimport $MODULENAME
will subsequently work but it is a problem, if I understand you correctly, if more than one distribution is installed into the same directory because then there is no way to prioritize one of the distributions without thereby also prioritizing all the other distributions in the same directory. By "prioritize" here I guess I mean move it closer to the front ofsys.path
. Other methods of manipulating its precedence before importing would be moving it closer to the end of thesys.path
or removing the directory from thesys.path
entirely. None of these can be done on only a single distribution if more than one distribution has been installed "directly" (by dint of copying its modules into place) in a single directory.One thing to note is that the standard distutils, without setuptools/zetuptoolz/distribute, always installs things in this way! If you're using the standard distutils, the only way to avoid doing this would be to create a new empty directory and install into that (with
--prefix
) and then add that to thesys.path
. You would have to do that for every distribution that you install.Okay, so now I can appreciate David-Sarah's conclusion: "We should never use
--single-version-externally-managed
to install Tahoe into any directory where other distributions might also be installed.". That makes sense. We already do avoid that, as far as I know. For example, quickstart.html doesn't advise the user to do that.Replying to [zooko]comment:8:
Exactly.
However, the problem in this ticket only applies to Tahoe-LAFS dependencies (direct or indirect, including the
allmydata-tahoe
package itself) that are installed in this way.This could be worked around in several ways:
a) change zetuptoolz to order "development" dists (directories containing Python code and an
.egg-info
subdirectory) after non-development dists when satisfying*requires*
lists. Note: thepkg_resources
code involved is complicated and I don't understand it.b) change zetuptoolz so that it does not view development dists as satisfying a dependency. That would cause a non-development dist to be used automatically instead. (pywin32 is a potential spanner in the works because it can't currently be installed as an egg, but maybe #142 could be fixed.) Fixing #1233 is important here, because otherwise you would have regressions where a package is already installed but only as a development dist, and building a new version of it fails because it can't be compiled.
c) change the wrapper scripts not to use the
*requires*
mechanism and to put the dependencies directly at the start ofsys.path
itself. This would still have to avoid viewing development dists as satisfying a dependency, or else it would run into the same problem.[...]
As far as I can see, nothing in our docs advises against installing Tahoe using
setup.py install
oreasy_install
. That's a standard way of installing Python packages.Besides, there's no good reason why that shouldn't work (that is, make the installed version the default, but still allow running other versions). Design failings in distutils and/or setuptools are a bad reason.
Replying to [davidsarah]comment:9:
Hmm, one way to make this work is for
setup.py install
(and thereforeeasy_install
) to actually do a build rather than an install, but also install atahoe
wrapper script for that build to the global scripts directory.Replying to [davidsarah]comment:10:
Maybe this is as simple as putting "
install = darcsver --count-all-patches develop --prefix=support make_executable build install_scripts
" in place of the existinginstall
alias in Tahoe's source:setup.cfg.Replying to [davidsarah]comment:11:
Apparently not. That does cause
python setup.py install
to build Tahoe successfully and generateC:\Python26\Scripts\tahoe.pyscript
, and the resulting script has a correct*requires*
line. Nevertheless it fails (on this machine) with the errorThis is understandable because
pycryptopp>=0.5.20
is not globally installed, the support dir of the Tahoe build is not on thePYTHONPATH
, andC:\Python26\Scripts\tahoe.pyscript
does not put it onto thePYTHONPATH
(since it is a zetuptoolz-generated script rather than the one from Tahoe'sbin
directory). Also note that it is usingpkg_resources
fromsetuptools-0.6c16dev
, which is the globally installed version.Getting
setup.py install
to work as I want it to is possible, but might have to wait for Brian's "unsuck" branch, since it's probably not a good idea to make the zetuptoolz-generated script as complicated as [the 'bin/tahoe' one]source:bin/tahoe-script.template.Attachment install-should-only-install-scripts.darcs.patch (19728 bytes) added
setup.py, setup.cfg, bin/tahoe-script.template: make it so that 'python setup.py install' just does a build and installs a script pointing to that build. refs #1246
Replying to [davidsarah]comment:10:
install-should-only-install-scripts.darcs.patch implements that. If we agree that it's a good idea, we would need a buildbot step to test it.
Replying to [davidsarah]comment:13:
... and reconsider how to implement install-to-prefix, which this patch breaks.
Replying to [davidsarah]comment:6:
Unfortunately, that's probably not sufficient (although it is necessary). The bug requires that at least one Tahoe dependency be installed as an
.egg-info
, but the other distribution in the same directory that gets imported by mistake does not itself have to have an.egg-info
directory.So, we should never use
--single-version-externally-managed
or equivalent to install Tahoe or any of its dependencies into any directory where other distributions might also be installed. But we can't control how users install, say, Twisted or pywin32 (which have installers that are equivalent to using--single-version-externally-managed
).Attachment misc-build_helper-test-1246.darcs.patch (12919 bytes) added
misc-build_helper-test-1246.darcs.patch is my attempt to exercise this problem. However, I don't think this attempt succeeds. When I run
python misc/build_helpers/test-dists-in-shared-directory.py
, I get this result:Maybe this means that the current
zetuptoolz
-based build system passes this test? Maybe the test that we want has two versions of the package, both of which are of a sufficiently high version number, and we are trying to show that the build system mistakenly imports one whenpkg_resources
reports that it uses the other.Note that this test works only if you don't have a sufficient version of
pycryptopp
installed in your system. If there is already a sufficient version of pycryptopp, then you'll get a skip.Whoops, in fact you get some sort of failure-to-run trial:
In at least some cases (#1258),
setup.py
finds the correct code butbin/tahoe
does not. Perhaps that difference is involved in the misc-build_helper-test-1246.darcs.patch test's failure to reproduce the problem (since the test invokesrun_trial.py
which invokessetup.py
).Replying to davidsarah:
Here is a untried test that imports pycryptopp in a subprocess, similar to
test_import_from_repl
in attachment🎫1258:add-test-import-in-repl.darcs.patch. Unfortunately it is complicated and depends on more than we want to depend on for this test, but let's see whether it helps to reproduce the problem.[to fix an import error.]Edited
The test in comment:18 does reproduce the error for me:
It still needs work, though. If I delete the temp directory and the code that creates the fake pycryptopp 0.5.13, I get a spurious failure:
That is, the repl indirectly attempts to import
pycryptopp.cipher.aes
. Working around that by removing the dependency on AES fromfileutil.py
, I get a skip:(0.5.25-!r757 is the version of pycryptopp that normally gets used when running Tahoe on this system. It is not installed in site-packages.)
The reason why this happens is that the
bin/tahoe
script adds thesupport/{Lib,lib/pythonx.x}/site-packages
directory to PYTHONPATH before the entries that were inherited fromtest-dists-in-shared-directory.py
. Therefore, it finds the real pycryptopp before the fake pycryptopp 99.0.0. I confirmed that by modifying the test to print PYTHONPATH in the repl, which gives (one entry on each line for clarity):(Ignore the foolscap entry, that's a workaround for foolscap 0.6.0 not being found on my system.)
My conclusion at this point is that we need to know why the problem is only occurring in subprocesses, before we can write a reliable test.
Removing
review-needed
. I will try to write a test.In changeset:e52d97a917364cb0:
Replying to davidsarah:
This is due to a different order for entries in
*requires*
. With help from abadger1999 (Toshio Kuratomi), we worked out a reliable test, using only three dists (two versions of one package and one of another), that will be independent of any globally installed packages. I'll prepare a patch for that.I'm now sure that this is the same issue as #1258, so I'm marking it as a duplicate (but all of the discussion in this ticket from comment:4 on is still relevant).