If you are Pure Storage customer and you recently logged into Pure1 (which is our cloud-based management/analytics/support portal), you might have noticed a notification the first time you logged in this week:
What is this? Well this is the great thing about Pure1–we just add features all of the time and this is an exciting one. Two years or so ago I wrote this blog post:
That went into storage troubleshooting with ESXi. Got a lot of positive feedback on that and also a lot of questions–it was a situation where people definitely needed more helpful tools to identify performance issues, or simply just to track who was doing what and where.
This week we introduced what we call “VM Analytics” to Pure1. All customers get it for free.
How does it work? Well, first, you configure a collector. There are two options for this. We can either run the collector inside of a FlashArray (it runs inside of our Purity/Run platform as a docker container) or you can instantiate the container on docker yourself as an off array collector. The collector then is authorized to one or more vCenters (read-only creds are fine) and pulls in your VM topologies and metrics. It pulls in:
- Virtual Disks
From a metric standpoint, it pulls in the relevant metrics for VMs, Datastores, Virtual disks, and hosts:
- IOPS (R and W)
- Throughput (R and W)
- Latency (R and W)
Each vCenter is polled every 10 minutes and then this information then gets dialed home back to Pure1 hourly. Inside of Pure1, you can now see you VMware topology. Sort VMs or their disks by latency, IOPS etc. What VM is where? What datastore is on what FlashArray? What are the latency differences through the stack?
So before we dive into how you use it, let’s go through setting it up.
As I mentioned there are two methods for the collector on array or off array. For on array, just simply open a Pure support case and ask for it. What are the requirements?
Purity 4.10.x (preferably latest)
M20 or higher
Fairly straight forward. If you do not have at least one array that matches those requirements, fear not–the off array collector is for you. Let’s walk through setup.
On-Array Collector vs Off-Array
Personally I am a fan of the off array collector–it is super easy to setup, you just need a vanilla Ubuntu VM and the collector takes only a few minutes to setup at most. And you can do it entirely yourself, no need to call support.
I would recommend the on-array in two scenarios:
- Your network cannot route out. In this case your Ubuntu VM cannot talk to the internet and cannot download what you need for the collector.
- You cant/dont use Ubuntu.
Install the On Array Collector
Call Pure support–they do all the work for you.
Default credentials are pureuser/pureuser. You can find the connection IP to SSH to by logging into your FlashArray GUI:
Install the Off Array Collector
For a VERY DETAILED step-by-step process go see this blog post
If you know how to install/configure Ubuntu this post should do just fine, so continue on.
NOTE: if you had support configure the on array collector you can skip all of this and go to the section on Configuring the collector.
The step is to login to pure1.
Click on VM Topology then the gear icon that says “Collector configuration” when you hover over it.
In the pop-up that appears, click Create Collector and enter a name for your collector.
There are no real requirements on this name, this is just a friendly name for you to use as a reference.
Click copy and save this for later.
The next step is to configure my collector. In this case, I have a Ubuntu 16.04.1 VM:
A pretty bare bones Ubuntu installation at this point. The following are the requirements:
x86_64 cpu with at least 2 cores Minimum 2GB RAM
Docker EE 1.11.2 or newer, or CE 17.03 or newer
python 3.5.2 or later
logrotate, and cron installed and running
I have python 3.5.2 already:
If you do not have it installed, install it. I won’t cover it here but there are plenty of blogs out there that walk through that–it is pretty straight forward.
Next to install Docker–I will go with CE. I will also not walk through this, because there is a great and accurate resource here:
This walks through installing it. Follow ALL of the instructions (step 1-4 under SET UP THE REPOSITORY and step 1 and 2 under INSTALL DOCKER CE).
Do not skip the apt-get updates (there are two). Configure the repository and then install Docker. Just run the commands listed in that page. Takes a minute or so.
When you are done you should have Docker ready to go:
Now install the collector. Grab that script you copied from Pure1. Mine is:
env COLLECTOR_AUTH_KEY=dxxxxxxxxxxxxxxxxxx8 bash -c "$(wget -O - https://static.pure1.purestorage.com/vm-analytics-collector/install.sh)"
Run that now. You will likely have to add sudo to it. If you forget, it will remind you.
This will not take long (10-20 seconds). It will download the necessary files and install the collector.
Configure the Collector
If you are using the off-array collector, just SSH into that VM. If you are using the on-array collector, SSH into the Docker VM support configured for you.
Now authenticate with vCenter! I will add three of mine using the purevmanalytics command:
You should add sudo here too likely.
sudo purevmanalytics connect --hostname <hostname>--username <username> --password <password>
You can review the status with purevmanalytics list:
You’re done! You should see data starting to populate in Pure1 in an hour or so.
The collector puts its logs here:
You can look at the phonehome status in the ccm log and general collection information in the vmanalytics.log file.
Also you can check in Pure1 for the last time the collector has phoned home:
Note that it might take a few hours for the first metrics to show up. Even if you force a phonehome. If the logs have not rolled over yet, nothing will be sent.
Some basic questions:
Do I need many collectors?
No usually. As long as the collector can reach all of your vCenters, you do not need to have more than one. If the networks are firewalled, then you might though if a given collector and not reach a certain vCenter.
Do I need to authenticate my FlashArrays?
No. They dial home themselves, Pure1 already has their information/metrics.
What are the network requirements?
Refer to this:
Can I force a phone home?
Yes for the off-array collector only though. purevmanalytics phonehome.
What FlashArray models does it support?
While the collector can only run on certain FlashArray models–all support FlashArrays are supported with the feature itself. So 400 series, //m and //x. They will all show up and be available in the Pure1 topology.
vCenter 5.5 and later.
Does it support VVols? RDMs? VMFS?
Today this supports VMFS only. VVols and RDMs are coming.
Not today (2018). This is VMware only at this point, but it will change.
In the next post, I will go over using it and what you can see in Pure1.