This document illustrates how to link Markit Securities Finance data with CRSP data
Brief Overview of Markit Securities Finance Data
Markit RED (RED) is the market standard for reference data in the credit markets. RED provides a unique 6-digit identifier, redcode
, for each entity in the database. In addition, it also carries several other common entity identifiers, such as 6-digit entity_cusip
, ticker
as well as company name strings.
Researchers can use these identifiers to link the CDS data with other data sources, such as CRSP and TRACE for equity and fixed income respectively.
Identifiers within MSF
MSF data does not carry separate table containing security description information. Instead, all identifying information are reported daily together with the time series data.
MSF Analytics
For the case of MSF Analytics data, the identifiers are
- DXLID- MSF unique identifier, known as DataExplorer ID as MSF was previously an independent data company DataExplorer
- ISIN
- SEDOL
- CUSIP - 9 digit historical CUSIP
- QUICK - Instrument identifier used in Japan, empty for all non-Japanese securities
- InstrumentName - Security description
- MarketArea - Market classification of security defining the country or currency of trading, security type
Below is a sample security description field from MSF Analytics American Equity 2022 data.
DX00000021 | US98956P1021 | 2783815 | 98956P102 | Zimmer Biomet Holdings Inc | US Equity (S&P500) |
---|---|---|---|---|---|
DX00000023 | US7901481009 | 2768663 | 790148100 | St Joe Co | US Equity (RUSSELL 2000) |
DX00000026 | US6934751057 | 2692665 | 693475105 | Pnc Financial Services Group Inc | US Equity (S&P500) |
MSF Institutional
MFS Institutional carries very similar identifier information:
- DXLID - MSF unique identifier
- ISIN
- SEDOL
- CUSIP - 9 digit historical CUSIP
- Quick - Instrument identifier used in Japan, empty for all non-Japanese securities
- StockDesc - Stock Description
- MarketArea - Market Area (Country level)
- BBGID - Bloomberg Global Identifier (FIGI), available starting in 2017
- BB_TICKER - Bloomberg Ticker, available starting in 2017
Below is a sample security description field from MSF Institutional American Equity 2022 data.
DXLID | ISIN | SEDOL | CUSIP | StockDesc | MarketArea | BBGID | BB_TICKER |
---|---|---|---|---|---|---|---|
21 | US98956P1021 | 2783815 | 98956P102 | Zimmer Biomet Holdings Inc |
USA Equity |
BBG000BKPMZ8 | ZBH UN |
23 | US7901481009 | 2768663 | 790148100 | St Joe Co |
USA Equity |
BBG000D6VGB9 | JOE UN |
26 | US6934751057 | 2692665 | 693475105 | Pnc Financial Services Group Inc |
USA Equity |
BBG000BRD2F2 | PNC UN |
Notice that though both MSF Analytics and Intuitional data carry MSF unique identifier DXLID
at instrument level, but the format of the variable in respective database is slight different.
Take the security Zimmer Biomet Holdings Inc for instance, in MSF Analytics its DXLID
= DX00000021, while in MSF Institutional its DXLID
= 21. It appears the two identifier systems follow the same internal ordering system, but has different representation. We just want to point this out in this section of discussion in case users would like to link across these two MSF databases.
Connecting with CRSP
We illustrate below using the new CIZ format of CRSP data. The logic is the same for the legacy SIZ format of CRSP data, just with different database syntax. And since CRSP is for North American trading equities , we focus on linking with the portion of the MSF data that is for American equities, and the primary linking key is through historical CUSIP that is common in both databases.
Extract MSF Identifiers
As MSF doesn't carry a separate identifier table, researchers need to extract the relevant information from the actual time series datasets. We illustrate below how to quickly extract the information from both the MSF Analytics and Institutional data.
Please make sure to select "SAS" in the kernel list when trying to run the code in Jupyter Notebook environment.
Jupyter Notebook: ResearchApps/LinkMSFCRSP.ipynb Download
In the SQL code above,
- We first create a crspid table that contains the CRSP identifiers permno, permco, cusip as well as the start and end date range of such combination in the database
- We then join this crspid table with the MSF identifiers through the common historical CUSIP link
- Lastly, we add the header CRSP company name to the linked database in case researchers would like to compare
The linking logic of MSF Institutional is the same as MSF Analytics, so we are not repeating the code here.
Note: the new CIZ CRSP has different name convention from the legacy SIZ, and in the new format, historical cusip's syntax is CUSIP, not NCUSIP.