Adding columns to a data table
You can add columns to your current data table to create a join between data from different sources and merge it to a single data table. Data can be added from a local file, a data file previously saved in the Spotfire library (using the Windows client), or, you can add data directly from Google Analytics or from Salesforce, if you have access to those systems.
Even though data from different data tables often can be viewed in the same visualization, it may be preferred to actually incorporate similar data into a single data table. When multiple data tables are used in one visualization, the join of the two tables is done after aggregation has been performed. Therefore, if you want to perform joining of tables before any aggregations, or, if you want to create certain calculated columns based on data from two sources, you might want to insert data from one table to the other.
When you add data into your analysis, you can select to add the data as new columns in another data table, from the final step in the add data workflow:
You can also insert columns into a specific place in a data table, as described below.
On the authoring bar, click
Make sure the data table of interest is selected in the upper-left drop-down list.
- In the data table structure, click the plus sign between the nodes where you want to add columns, and select Add columns.
- Select data to insert columns from.
- In the flyout, click Settings for added columns.
Add columns – match columns dialog, verify that the column match suggested by the recommendation engine is the way you want it, or configure the matching as desired. You can add or remove matches, or edit an existing match. To edit a match, click on the row with the match and select one column
From original data and one
From new data. Click on the x on a row to remove a match.
Tip: When adding columns, matching is done to enable joining between two data sets. Typically, only one or a few columns containing row identifiers should be used in the match.
- To the right in the dialog, you see a preview of the data, using the current settings. You can change the Number of input rows to include more rows (from both input tables) in the sample calculation. This may increase the accuracy of the shown result. However, using too many rows can decrease the performance of the preview.
Optionally, if you do not wish to add all available columns from the added data, as is the default, you can select to skip columns under
Columns from new data by clearing the check box for the undesired columns.
Note: If the data source from which data was added can be reloaded, and new columns become available after a reload, then the new columns will automatically become included. If this is not desired, you may want to exclude new columns after reloading data by editing the settings for added columns.
- Review the Join settings. Look at the interactive Join example, at the bottom of the Join settings field, for information about how rows will be matched using the different join types. You can specify whether or not to match on empty values using the Treat empty values as equal check box.
- When you are done, click OK.
Example: Adding columns from Salesforce
In this example, data has already been opened from a local Excel file. In the Data canvas, select the data table of interest, determine where you want to add columns (between the nodes in the graphical structure or between transformation groups), and click Add columns:
Choose a new or a predefined connection to Salesforce and sign in. Select the data to add and click OK. In the Add columns – match columns dialog, verify that the suggested column matching is valid, or add or remove matches between columns From original data and From new data. When you are satisfied with the data selection and settings, click OK to close the flyout and add the data.