Skip to content
Snippets Groups Projects
  1. Jul 22, 2016
    • Peter Maydell's avatar
    • Peter Maydell's avatar
      target-sh4: Use glib allocator in movcal helper · 01a72012
      Peter Maydell authored
      
      Coverity spots that helper_movcal() calls malloc() but doesn't
      check for failure. Fix this by switching to the glib allocation
      functions, which abort on allocation failure.
      
      Signed-off-by: default avatarPeter Maydell <peter.maydell@linaro.org>
      Message-id: 1468327859-21385-1-git-send-email-peter.maydell@linaro.org
      Acked-by: default avatarAurelien Jarno <aurelien@aurel32.net>
      01a72012
    • Peter Maydell's avatar
      Merge remote-tracking branch 'remotes/amit-migration/tags/migration-for-2.7-6' into staging · e3643d32
      Peter Maydell authored
      
      Migration:
      - Fix a postcopy bug
      - Add a testsuite for measuring migration performance
      
      # gpg: Signature made Fri 22 Jul 2016 08:56:44 BST
      # gpg:                using RSA key 0xEB0B4DFC657EF670
      # gpg: Good signature from "Amit Shah <amit@amitshah.net>"
      # gpg:                 aka "Amit Shah <amit@kernel.org>"
      # gpg:                 aka "Amit Shah <amitshah@gmx.net>"
      # Primary key fingerprint: 48CA 3722 5FE7 F4A8 B337  2735 1E9A 3B5F 8540 83B6
      #      Subkey fingerprint: CC63 D332 AB8F 4617 4529  6534 EB0B 4DFC 657E F670
      
      * remotes/amit-migration/tags/migration-for-2.7-6:
        tests: introduce a framework for testing migration performance
        scripts: ensure monitor socket has SO_REUSEADDR set
        scripts: set timeout when waiting for qemu monitor connection
        scripts: refactor the VM class in iotests for reuse
        scripts: add a 'debug' parameter to QEMUMonitorProtocol
        scripts: add __init__.py file to scripts/qmp/
        migration: set state to post-migrate on failure
      
      Signed-off-by: default avatarPeter Maydell <peter.maydell@linaro.org>
      e3643d32
    • Daniel P. Berrangé's avatar
      tests: introduce a framework for testing migration performance · 409437e1
      Daniel P. Berrangé authored
      
      This introduces a moderately general purpose framework for
      testing performance of migration.
      
      The initial guest workload is provided by the included 'stress'
      program, which is configured to spawn one thread per guest CPU
      and run a maximally memory intensive workload. It will loop
      over GB of memory, xor'ing each byte with data from a 4k array
      of random bytes. This ensures heavy read and write load across
      all of guest memory to stress the migration performance. While
      running the 'stress' program will record how long it takes to
      xor each GB of memory and print this data for later reporting.
      
      The test engine will spawn a pair of QEMU processes, either on
      the same host, or with the target on a remote host via ssh,
      using the host kernel and a custom initrd built with 'stress'
      as the /init binary. Kernel command line args are set to ensure
      a fast kernel boot time (< 1 second) between launching QEMU and
      the stress program starting execution.
      
      None the less, the test engine will initially wait N seconds for
      the guest workload to stablize, before starting the migration
      operation. When migration is running, the engine will use pause,
      post-copy, autoconverge, xbzrle compression and multithread
      compression features, as well as downtime & bandwidth tuning
      to encourage completion. If migration completes, the test engine
      will wait N seconds again for the guest workooad to stablize on
      the target host. If migration does not complete after a preset
      number of iterations, it will be aborted.
      
      While the QEMU process is running on the source host, the test
      engine will sample the host CPU usage of QEMU as a whole, and
      each vCPU thread. While migration is running, it will record
      all the stats reported by 'query-migration'. Finally, it will
      capture the output of the stress program running in the guest.
      
      All the data produced from a single test execution is recorded
      in a structured JSON file. A separate program is then able to
      create interactive charts using the "plotly" python + javascript
      libraries, showing the characteristics of the migration.
      
      The data output provides visualization of the effect on guest
      vCPU workloads from the migration process, the corresponding
      vCPU utilization on the host, and the overall CPU hit from
      QEMU on the host. This is correlated from statistics from the
      migration process, such as downtime, vCPU throttling and iteration
      number.
      
      While the tests can be run individually with arbitrary parameters,
      there is also a facility for producing batch reports for a number
      of pre-defined scenarios / comparisons, in order to be able to
      get standardized results across different hardware configurations
      (eg TCP vs RDMA, or comparing different VCPU counts / memory
      sizes, etc).
      
      To use this, first you must build the initrd image
      
       $ make tests/migration/initrd-stress.img
      
      To run a a one-shot test with all default parameters
      
       $ ./tests/migration/guestperf.py > result.json
      
      This has many command line args for varying its behaviour.
      For example, to increase the RAM size and CPU count and
      bind it to specific host NUMA nodes
      
       $ ./tests/migration/guestperf.py \
             --mem 4 --cpus 2 \
             --src-mem-bind 0 --src-cpu-bind 0,1 \
             --dst-mem-bind 1 --dst-cpu-bind 2,3 \
             > result.json
      
      Using mem + cpu binding is strongly recommended on NUMA
      machines, otherwise the guest performance results will
      vary wildly between runs of the test due to lucky/unlucky
      NUMA placement, making sensible data analysis impossible.
      
      To make it run across separate hosts:
      
       $ ./tests/migration/guestperf.py \
             --dst-host somehostname > result.json
      
      To request that post-copy is enabled, with switchover
      after 5 iterations
      
       $ ./tests/migration/guestperf.py \
             --post-copy --post-copy-iters 5 > result.json
      
      Once a result.json file is created, a graph of the data
      can be generated, showing guest workload performance per
      thread and the migration iteration points:
      
       $ ./tests/migration/guestperf-plot.py --output result.html \
              --migration-iters --split-guest-cpu result.json
      
      To further include host vCPU utilization and overall QEMU
      utilization
      
       $ ./tests/migration/guestperf-plot.py --output result.html \
              --migration-iters --split-guest-cpu \
      	--qemu-cpu --vcpu-cpu result.json
      
      NB, the 'guestperf-plot.py' command requires that you have
      the plotly python library installed. eg you must do
      
       $ pip install --user  plotly
      
      Viewing the result.html file requires that you have the
      plotly.min.js file in the same directory as the HTML
      output. This js file is installed as part of the plotly
      python library, so can be found in
      
        $HOME/.local/lib/python2.7/site-packages/plotly/offline/plotly.min.js
      
      The guestperf-plot.py program can accept multiple json files
      to plot, enabling results from different configurations to
      be compared.
      
      Finally, to run the entire standardized set of comparisons
      
        $ ./tests/migration/guestperf-batch.py \
             --dst-host somehost \
             --mem 4 --cpus 2 \
             --src-mem-bind 0 --src-cpu-bind 0,1 \
             --dst-mem-bind 1 --dst-cpu-bind 2,3
             --output tcp-somehost-4gb-2cpu
      
      will store JSON files from all scenarios in the directory
      named tcp-somehost-4gb-2cpu
      
      Signed-off-by: default avatarDaniel P. Berrange <berrange@redhat.com>
      Message-Id: <1469020993-29426-7-git-send-email-berrange@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      409437e1
    • Daniel P. Berrangé's avatar
      scripts: ensure monitor socket has SO_REUSEADDR set · 168ae6c2
      Daniel P. Berrangé authored
      
      If tests use a TCP based monitor socket, the connection will
      go into a TIMED_WAIT state when the test exits. This will
      randomly prevent the test from being re-run without a certain
      time period. Set the SO_REUSEADDR flag on the socket to ensure
      we can immediately re-run the tests
      
      Signed-off-by: default avatarDaniel P. Berrange <berrange@redhat.com>
      Message-Id: <1469020993-29426-6-git-send-email-berrange@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      168ae6c2
    • Daniel P. Berrangé's avatar
      scripts: set timeout when waiting for qemu monitor connection · 23806462
      Daniel P. Berrangé authored
      
      If QEMU fails to launch for some reason, the QEMUMonitorProtocol
      class accept() method will wait forever in a socket accept call.
      Set a timeout of 15 seconds so that we fail more gracefully
      instead of hanging the test script forever
      
      Signed-off-by: default avatarDaniel P. Berrange <berrange@redhat.com>
      Message-Id: <1469020993-29426-5-git-send-email-berrange@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      23806462
    • Daniel P. Berrangé's avatar
      scripts: refactor the VM class in iotests for reuse · 66613974
      Daniel P. Berrangé authored
      
      The iotests module has a python class for controlling QEMU
      processes. Pull the generic functionality out of this file
      and create a scripts/qemu.py module containing a QEMUMachine
      class. Put the QTest integration support into a subclass
      QEMUQtestMachine.
      
      Signed-off-by: default avatarDaniel P. Berrange <berrange@redhat.com>
      Message-Id: <1469020993-29426-4-git-send-email-berrange@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      66613974
    • Daniel P. Berrangé's avatar
      scripts: add a 'debug' parameter to QEMUMonitorProtocol · 991e7c46
      Daniel P. Berrangé authored
      
      Add a 'debug' parameter to the QEMUMonitorProtocol class
      which will cause it to print out all JSON strings on
      sys.stderr
      
      Signed-off-by: default avatarDaniel P. Berrange <berrange@redhat.com>
      Message-Id: <1469020993-29426-3-git-send-email-berrange@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      991e7c46
    • Daniel P. Berrangé's avatar
      scripts: add __init__.py file to scripts/qmp/ · 6f7a4a81
      Daniel P. Berrangé authored
      
      When searching for modules to load, python will ignore any
      sub-directory which does not contain __init__.py. This means
      that both scripts and scripts/qmp/ have to be explicitly added
      to the python path. By adding a __init__.py file to scripts/qmp,
      we only need add scripts/ to the python path and can then simply
      do 'from qmp import qmp' to load scripts/qmp/qmp.py.
      
      Signed-off-by: default avatarDaniel P. Berrange <berrange@redhat.com>
      Message-Id: <1469020993-29426-2-git-send-email-berrange@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      6f7a4a81
    • Dr. David Alan Gilbert's avatar
      migration: set state to post-migrate on failure · 42da5550
      Dr. David Alan Gilbert authored
      If a migration fails/is cancelled during the postcopy stage we currently
      end up with the runstate as finish-migrate, where it should be post-migrate.
      There's a small window in precopy where I think the same thing can
      happen, but I've never seen it.
      
      It rarely matters; the only postcopy case is if you restart a migration, which
      again is a case that rarely matters in postcopy because it's only
      safe to restart the migration if you know the destination hasn't
      been running (which you might if you started the destination with -S
      and hadn't got around to 'c' ing it before the postcopy failed).
      Even then it's a small window but potentially you could hit if
      there's a problem loading the devices on the destination.
      
      This corresponds to:
      https://bugzilla.redhat.com/show_bug.cgi?id=1355683
      
      
      
      Signed-off-by: default avatarDr. David Alan Gilbert <dgilbert@redhat.com>
      Reviewed-by: default avatarAmit Shah <amit.shah@redhat.com>
      Message-Id: <1468601086-32117-1-git-send-email-dgilbert@redhat.com>
      Signed-off-by: default avatarAmit Shah <amit.shah@redhat.com>
      42da5550
  2. Jul 21, 2016
Loading