MIT License
ParaMonte: plain powerful parallel Monte Carlo library.
Copyright (C) 2012-present, The Computational Data Science Lab
https://github.com/cdslaborg/paramonte

Making 2D and 3D visualizations of the evolution of the proposal distributions of the Adaptive MCMC samplers of the ParaMonte library

NOTE
If you are viewing an HTML version of this MATLAB live script on the web, you can download the corresponding MATLAB live script visualization_proposal_ellipsoid.mlx file to this HTML page at,
https://github.com/cdslaborg/paramontex/blob/fbeca6745684c798ff28c1bf57cfae0c190db478/MATLAB/mlx
Once you download the file, open it in MATLAB to view and interact with its contents, which is the same as what you see on this page.
First, let's clean up the MATLAB environment and make sure the path to the ParaMonte library is in MATLAB's path list.
clc;
clear all;
close all;
format compact; format long;
%%%%%%%%%%%% IMPORTANT %%%%%%%%%%%%%
% Set the path to the ParaMonte library:
% Change the following path to the ParaMonte library root directory,
% otherwise, make sure the path to the ParaMonte library is already added
% to MATLAB's path list.
pmlibRootDir = './';
addpath(genpath(pmlibRootDir),"-begin");
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
% change MATLAB's working directory to the folder containing this script
% if MATLAB Live Scripts did not create a temporary folder, we would not
% have all of these problems!
try
setwdFilePath = websave("setwd.m","https://github.com/cdslaborg/paramontex/blob/fbeca6745684c798ff28c1bf57cfae0c190db478/MATLAB/mlx/setwd.m");
run(setwdFilePath); % This is a MATLAB script that you can download from the same GitHub location given in the above.
catch % alas, we will have to run the simulations in MATLAB Live Script's temporary folder
filePath = mfilename('fullpath');
[currentDir,fileName,fileExt] = fileparts(filePath); cd(currentDir);
cd(fileparts(mfilename('fullpath'))); % Change working directory to source code directory.
end

ParaMonte's default visualization tools

The ParaMonte library ships with several visualization tools that automate much of the MATLAB coding required to visualize the output of the simulations performed by the ParMonte library samplers.
By replacing the input dataFrame to these tools and following the conventions of the ParaMonte library, one can also use these visualization tools for any dataset that may not have been generated by the ParaMonte library.
Consider the following restart file on the web which contains the restart information in ASCII format as indicated by the extension of the file (.txt). This file is generated by the ParaDRAM sampler for sampling a 4-dimensional Multivariate Normal distribution. Since this is a restart file as indicated by its suffix "_restart.txt" We will read this file via the ParaDRAM sampler's readRestart() method.
In addition to the restart information, the ASCII-formatted restart files that are generated by the Adaptive MCMC samplers of the ParaMonte library (such as the ParaDRAM and ParaDISE samplers) also contain information about the evolution of the adaptations of the proposal distributions of the MCMC samplers (if any adaptation occurs during the simulation). Monitoring this adaptation and ensuring that it diminishes toward the end of the simulation is not only essential for the successful unbiased sampling of the target density function but can also be educational.
For this reason, the adaptive MCMC samplers of the ParaMonte library provide a set of tools that can automatically visualize the evolution of the covariance or correlation matrices of the proposal distributions of the samplers. These tools are encapsulated in a class named EllipsoidPlot(). By default, when an ASCII-formated restart file is parsed, via for example, the readRestart() method of the ParaDRAM sampler, the parser routine automatically adds several plotting tools to the output object of the parser which contains the parsed restart data as well.
pm = paramonte();
pmpd = pm.ParaDRAM();
url = "https://github.com/cdslaborg/paramontex/blob/fbeca6745684c798ff28c1bf57cfae0c190db478/MATLAB/mlx/sampling_multivariate_normal_distribution_via_paradram/out/mvn_serial_process_1_restart.txt";
pmpd.readRestart(url); % read the restart file from the web
ParaDRAM - NOTE: 1 files detected matching the pattern: ParaDRAM - NOTE: "https://github.com/cdslaborg/paramontex/blob/fbeca6745684c798ff28c1bf57cfae0c190db478/MATLAB/mlx/sampling_multivariate_normal_distribution_via_paradram/out/mvn_serial_process_1_restart.txt" ParaDRAM - NOTE: processing file: D:\Dropbox\Projects\20180101_ParaMonte\paramontex\MATLAB\mlx\visualization\temp_20201004_041428_125.txt ParaDRAM - NOTE: reading the file contents... |100% ParaDRAM - NOTE: done in 10.455000 seconds. ParaDRAM - NOTE: creating the line plot object from scratch... ParaDRAM - NOTE: done in 0.273490 seconds. ParaDRAM - NOTE: creating the scatter plot object from scratch... ParaDRAM - NOTE: done in 0.066011 seconds. ParaDRAM - NOTE: creating the lineScatter plot object from scratch... ParaDRAM - NOTE: done in 0.067516 seconds. ParaDRAM - NOTE: creating the covmat2 plot object from scratch... ParaDRAM - NOTE: done in 0.116550 seconds. ParaDRAM - NOTE: creating the covmat3 plot object from scratch... ParaDRAM - NOTE: done in 0.055672 seconds. ParaDRAM - NOTE: creating the cormat2 plot object from scratch... ParaDRAM - NOTE: done in 0.107170 seconds. ParaDRAM - NOTE: creating the cormat3 plot object from scratch... ParaDRAM - NOTE: done in 0.015100 seconds. ParaDRAM - NOTE: The processed restart files are now stored in the newly-created component "pmpd.restartList" of the ParaDRAM - NOTE: ParaDRAM object as a cell array. For example, to access the contents of the first (or the only) ParaDRAM - NOTE: restart file, try: ParaDRAM - NOTE: ParaDRAM - NOTE: pmpd.restartList{1}.df ParaDRAM - NOTE: ParaDRAM - NOTE: To access the plotting tools, try: ParaDRAM - NOTE: ParaDRAM - NOTE: pmpd.restartList{1}.plot.<PRESS TAB TO SEE THE LIST OF PLOTS> ParaDRAM - NOTE: ParaDRAM - NOTE: For example, ParaDRAM - NOTE: ParaDRAM - NOTE: pmpd.restartList{1}.plot.line.make() % to make bivariate line plots. ParaDRAM - NOTE: pmpd.restartList{1}.plot.scatter.make() % to make bivariate scatter plots. ParaDRAM - NOTE: pmpd.restartList{1}.plot.lineScatter.make() % to make bivariate line-scatter plots. ParaDRAM - NOTE: pmpd.restartList{1}.plot.covmat2.make() % to make proposal covariance evolution 2D plots. ParaDRAM - NOTE: pmpd.restartList{1}.plot.covmat3.make() % to make proposal covariance evolution 3D plots. ParaDRAM - NOTE: pmpd.restartList{1}.plot.cormat2.make() % to make proposal correlation evolution 2D plots. ParaDRAM - NOTE: pmpd.restartList{1}.plot.cormat3.make() % to make proposal correlation evolution 3D plots. ParaDRAM - NOTE: ParaDRAM - NOTE: For more information and examples on the usage, visit: ParaDRAM - NOTE: ParaDRAM - NOTE: https://www.cdslab.org/paramonte
Note that these visualization tools are not unique to this particular method of the ParaDRAM sampler or other ParaMonte samplers. For the sake of illustration however, we will create plots using the above dataset read via readrestart() method of the ParaDRAM routine.
restart = pmpd.restartList{1};
restart.plot.line.axes.kws.xscale = "log";
restart.plot.line.make();
Be default, the visualization tools are loaded with a set of predefined settings. For example, ParaMonte visualizations are by default colored. These however, can be readily changed. For example, to change the colormap,
restart.plot.line.colormap.values
ans = "winter"
restart.plot.line.colormap.values = autumn;
restart.plot.line.make()
More importantly, to visualize the evolution of the proposal distribution of the sampler in 2D via its covariance matrix, one can try,
restart.plot.covmat2.make();
Note that the covariance matrix of the proposal distribution is 4-dimensional in this example here since the target density function which has been sampled is 4 dimensional. However, only two dimensions out of the four dimensions of the covariance matrix can be visualized in a single plot. By default, the first two dimensions are selected to be visualized. However, this can be readily changed. For example, to visualize dimensions 2 and 4 of the covariance metrices of the proposal distribution,
restart.plot.covmat2.dimensionPair
ans = 1×2
1 2
restart.plot.covmat2.dimensionPair = [ 2 4 ];
restart.plot.covmat2.make();
By default, the covariance matrices are centered at the mean of the sampled stated by the adaptive MCMC sampler up to the stage when the covariance matrix was computed in the simulation. Sometimes, it helps to fix the centers of all covariance matrices all at the origin of the plot so that one could better focus on the variations in the shape and scale of the the covariance matrices. This can easily done,
restart.plot.covmat2.centerColumn
ans = 1×1 cell array {'meanVec'}
restart.plot.covmat2.centerColumn = []; % place all covariance matrices at the origin of the plot.
restart.plot.covmat2.make();
The above plot is likely cluttered with too many ellipsoids. The number of ellipsoids can be reduced by only plotting a select representative number of ellipsoids form the restart file. Since much of the proposal adapations happen at the beginning of the simulation, a simple jump through the covariance matrices will likely not result in a good display,
restart.plot.covmat2.rows = 1:50:restart.count;
restart.plot.covmat2.make();
A better approach would be to jump through the covariance matrices uniformly in the loarithmic space,
restart.plot.covmat2.rows = restart.plot.covmat2.getLogLinSpace ( 1.2 ... this is the base of the logarithm
, 0.7 ... this is the size of the jumps in the logarithmic space
... the start of the series goes here. The default is 1.
... the start of the series goes here. The default is the total number of covariance matrices.
);
restart.plot.covmat2.make();
To reset the properties of the plot object to the default settings, try,
restart.plot.covmat2.reset();
ParaDRAM - NOTE: resetting the properties of the covmat2 plot...
To reset the entire line object including reading the data again from the input dataFrame, try,
restart.plot.covmat2.reset("hard");
ParaDRAM - NOTE: creating the covmat2 plot object from scratch...

Making 3D ellipsoid plots

If the 2D visualization is the optimal display one would wish to get, a 3D visualization may be more informative and illustrative,
restart.plot.covmat3.rows = restart.plot.covmat3.getLogLinSpace ( 1.3 ... this is the base of the logarithm
, 1 ... this is the size of the jumps in the logarithmic space
... the start of the series goes here. The default is 1.
... the start of the series goes here. The default is the total number of covariance matrices.
);
restart.plot.covmat3.make("dimensionPair", [1 2]);
Fixing the centers of the ellipsoids may also be informative,
restart.plot.covmat3.make("centerColumn", [], "dimensionPair", [1 2]);
As we see, the proposal converges toward a final shape and scale although the convergence seems to have fully occurred even toward the end of the simulation. This implies that we should have run this simulation for longer times.

Changing the properties of the colorbar

Similarly, to change the properties of the colorbar, try,
restart.plot.covmat3.colorbar.kws
ans = struct with fields: fontSize: 11
restart.plot.covmat3.colorbar.kws.fontSize = 13;
To change properties that do not exist, simple add them to the kws component, for example,
restart.plot.covmat3.colorbar.kws.location = "northoutside";
restart.plot.covmat3.make()
Remember that a handle to all objects in the plot is also stored in the currentFig component of the object. Most of the properties of the figure, axes, and the plots can be also changed directly via these handles. For example, to change the colorbar label, we could try,
restart.plot.covmat3.currentFig.colorbar.Label.String
ans = 'Adaptive Update Count'
restart.plot.covmat3.reset();
ParaDRAM - NOTE: resetting the properties of the covmat3 plot...
restart.plot.covmat3.make()
restart.plot.covmat3.currentFig.colorbar.Label.FontSize = 11;
restart.plot.covmat3.currentFig.colorbar.Label.Interpreter = "tex"; % set the interpreter for the colorbar
restart.plot.covmat3.currentFig.colorbar.Label.String = "The Number of Adaptations to the Proposal Distribution";

Visualizing the evolution of the correlation matrices of the proposal distribution of the adaptive MCMC samplers

All of the above tools and explanations equally apply to the visualization of the correlation matrices of the proposal distributions. The only difference is that the corresponding plotting components are named cormat2 and cormat3, as opposed to covmat2 and covmat3 for the visualization of the covariance matricies explained in the above.
restart.plot.cormat2.make("dimensionPair", [1 3]);
or, in 3D,
restart.plot.cormat3.reset("hard");
ParaDRAM - NOTE: creating the cormat3 plot object from scratch...
restart.plot.cormat3.make("dimensionPair", [2 4], "rows", restart.plot.cormat3.getLogLinSpace(1.3,0.7) );

Exporting figures to external files

To extract a figure to an external PNG file, try,
restart.plot.cormat3.exportFig("exportedFigure.png","-m4")
The above command will extract the current active figure to an output file with the relatively high resoluton as specified by the flag -m4. To make the exported figure smaller, one could specify -m2 instead. In addition, to generate a figures with background transparency, the flag -transparent can be added to the exportFig function call,
% restart.plot.cormat3.exportFig("exportedFigure.png","-m2 -transparent") % uncomment to export the figure with transparency
Final Note:
To see other more sophisticated types of plots that can be automatically made with the ParaMonte visualization tools, visit: https://www.cdslab.org/paramonte/notes/examples/matlab/mlx/

{% comment %}
{% endcomment %}
web counter