Skip to content

WeeklyTelcon_20170815

Geoffrey Paulsen edited this page Jan 9, 2018 · 1 revision

Open MPI Weekly Telcon


  • Dialup Info: (Do not post to public mailing list or public wiki)

Attendees

  • Josh Hursey (IBM)
  • Jeff Squyres (Cisco)
  • Geoff Paulsen (IBM)
  • Artem (Mellanox)
  • Brian Barrett
  • Geoffroy Vallee (ORNL)
  • Howard
  • Ralph Castain (Intel)
  • Nathan Hjelm (LANL)
  • Mohan

Agenda

Review v2.0.x Milestones v2.0.4

  • Nothing new to report.

Review v2.x Milestones v2.1.2

  • 212
    • Status of 4071 - PMIX visibility.
    • Released in PMIx 1.2.3.
    • Take this pending a review.
  • PR4059 - NEWs
    • Some discussion about -xrc issue. Is this a regression from v2.x or existed always.
    • Artem will test, and add info to the NEWs.
  • PR4080 - More than just VERSION change. Some cleanup, but also removes some hand-coded assembly.
  • Will roll a new v2.1.x RC after tonight's tests look good.

Review v3.0.x Milestones v3.0

  • PR 3980 - If you set OPAL_PREFIX, PMIX has it's own plugin directories. Moving OPAL_PREFIX
    • Fixed in 4052. - Will go into RC3.
  • XLC related tickets
  • Issue3992(https://github.com/open-mpi/ompi/issues/3984) - BSD div-by-0 in hwloc - More of an issue with a particular version of Free BSD issue, just document.
    • For this particular version, just use GCC, Clang introduces the issue.
  • Hostfile behavior change (https://github.com/open-mpi/ompi/issues/3984)
  • PR - libpmix with devel headers, a linking thing that needs to be fixed PR4046
  • PR - 4022 - hwloc upgrade to v1.11.7.
  • After PRs go in. Will Generate an RC 3 tomorrow after we get all of these in.
    • Do an RC4 when Ralph gets PMIx changes in.
  • Some discussion in mail that Open MPI not compatible with libevent 2.1.8,
    • It's REALLY just a timing issue that hits an IO forwarding issue. - NOT backported.
    • Add an item to the NEWS that we (v3.0) ARE now compatible with libevent 2.1.8

Review Master Master Pull Requests

  • Does appear we have a number of PMI install failures on master.
    • Perhaps could have been fixed, since it's dated on the 12th. Cisco / Absoft.
      • 'nm_check_prefix' - this test requires an ENV to run. There is a directive in Makefile.am to run.
        • This test is used to report exported symbols that shouldn't be exported.
        • For some environments this check isn't working. Will get Mark to look at again.
        • Mark (IBM) will look at.
    • gds_dstore in PMIx - compile error. Undefined reference to opal_atomic.
  • Something is missing in hwloc tarball - missing netloc
  • libpmix failed in linking against libopal.
    • Ralph will look at these.

MTT / Jenkins Testing

MTT Dev status:

Jenkins CI


Exceptional topics

  • AWS - 1 year renewal coming up.

    • Thank you Amazon.
    • Brian will take car of renewal.
  • Next face-to-face meeting

    • Jan / Feb
    • Dallas, San Jose, Portland, Albuquerque

Status Updates:

Status Update Rotation

  1. Mellanox, Sandia, Intel
  2. LANL, Houston, IBM, Fujitsu
  3. Amazon,
  4. Cisco, ORNL, UTK, NVIDIA

Back to 2017 WeeklyTelcon-2017

Clone this wiki locally