First Draft of Visualization
I'm working towards a visualization of female impersonators and have produced a first, really messy visualization. That's OK, though. I like working under the principle of "shitty first drafts." (see Anne Lamotte's text here)
This is the first step I'm taking toward the Connect New York grant this summer, where I hope to "visualize the networks between [pansy] performers in [New York City's] burlesque and vaudeville theatres . . . It will also entail mapping the theatres and other venues where such performers regularly performed" (read more here). It is, of course, also part of my dissertation research.
Here it is:
Basically what I have done is that I have searched ProQuest's Entertainment Industry Magazine Archive and looked at all the articles mentioning any of the following search terms:
"fem impersonator"-- not "female impersonator" yet, see below.
"female impressionist"-- interestingly a term really only used in British outlets until 1970s in the U.S.
"dragqueen"-- I haven't looked into "drag queen" and "female impersonator" as a search term yet because it would have expanded this search by thousands of articles. I will do so in the next step of this visualization.
"in drag"-- only looked until March 1961 (arbitrarily chosen but it just ended up being too much information and beyond the scope of what I want to do).
For each of these news articles, I wrote down the artists' names that were mentioned, and whichever group/company or production that they were performing in, and linked those to each of the theatres/nightclubs/physical venues where they appeared. I created a TSV file based on this information, in the hope that I'll be able to use Gephi at some point to map this all out. For the purposes of my short-term goal, however, of visualizing this small world of performance, I then mapped, manually (don't try this at home), all of those nodes using OmniGraffle. This is why we've ended up with this mess of a visualization. But: as messy as this looks, I'm still able to grab nodes and drag them around.
Below the graph, you can see all the sources for the nodes and connections (edges).
Note that I have kept the unnamed artists and groups as an empty slot ("______") in the visualization to make sure that the lacking information is as visible as the available information. This, of course, isn't the whole truth: there is more lacking information than only those artists.
Next steps that I'd love to take:
- Obviously add the results from "drag queen" and "female impersonator" -- though I need to figure out a way to use some scraping to get the names out of these thousands of articles... I am working on a Python solution, and will post more about it here eventually.
- Add more information using other archives
- for instance, I'd LOVE to find the entire run of Broadway Brevities, a publication that "hinted at the same-sex relationships or moral lapses of prominent industrialists and entertainment stars, and, in a remarkable series with the title Nights in Fairyland, described in detail the sites of gay and lesbian mingling in Manhattan." (Will Straw) Some of the issues exist in the Internet Archive, just not the parts that I'm really interested in.
- I should also just broaden the search in ProQuest and include other newspaper sources. The problem is, to give you a preview of the problem that a search for
"female impersonator" OR "female impersonators" OR "female impersonation" OR "female impersonationsacross the databases in ProQuest provided via my CUNY access, gives over 10,000 results, which means that the server cuts me off at results past 1934—i.e. only between 1841–1934, there are 10,000 newspaper articles available (note that this includes British sources like The Stage). That means that I have to go in chunks and download all the results.
- Geotag each of these cities and venues and map out these relationships geographically
- Build a larger database, potentially with crowdsourcing capacity, so that you all can help me find more sources, mentions, etc. and that can run a d3 visualization based on the growing information in the database.