Category Archives: mathematica

The most worthless subway stations in New York, part ii

In the last post, I demonstrated loading a GIS shapefile for New York City into Mathematica, loading the official subway station entry data from the city and overlaying it onto the map.

Below is the code to prepare and display an interactive version of the tool that also identifies and displays the stations on each line closest to each other and most remote.

preStationsForComp = Drop[stations, 1][[ All, {3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 26, 25}]]; preStationsForComp[[All, 13]] = preStationsForComp[[All, 13]]/1000000.; preStationsForComp[[All, 14]] = preStationsForComp[[All, 14]]/1000000.;


stationsForComp =

  Union[Map[{#[[1]], #[[2]], #[[3]], #[[4]], #[[5]], #[[6]], #[[7]], #[[

       8]], #[[9]], #[[10]], #[[11]], #[[12]], {#[[13]], #[[14]]}} &,

    preStationsForComp],

   SameTest -> ((#1[[1]] == #2[[1]]) && (#1[[2]] == #2[[2]]) && (#1[[

          3]] == #2[[3]]) && (#1[[4]] == #2[[4]]) && (#1[[5]] == #2[[

          5]]) && (#1[[6]] == #2[[6]]) && (#1[[7]] == #2[[7]]) && (#1[[

          8]] == #2[[8]]) && (#1[[9]] == #2[[9]]) && (#1[[10]] == #2[[

          10]]) && (#1[[11]] == #2[[11]]) && (#1[[12]] == #2[[12]]) &)];

(* one randomly selected entrance per station/line *)
entranceFromLine[line_] :=

 Select[stationsForComp, (#[[2]] == line || #[[3]] == line || #[[4]] ==

       line || #[[5]] == line || #[[6]] == line || #[[7]] == line || #[[8]] ==

        line || #[[9]] == line || #[[10]] == line || #[[11]] ==

       line || #[[12]] == line) &][[All, {1, 13}]]
entranceFromSimplifiedData[line_, station_] :=

 Select[stationsForComp, (#[[1]] ==

      station && (#[[2]] == line || #[[3]] == line || #[[4]] ==

        line || #[[5]] == line || #[[6]] == line || #[[7]] ==

        line || #[[8]] == line || #[[9]] == line || #[[10]] ==

        line || #[[11]] == line || #[[12]] == line)) &]

Manipulate[ Module[{key, closestations, farstations, lineEntrances, nearstationPair, farstationPair, closestToPointer}, key = Select[index, #[[2]] == whichline &][[1]][[1]]; closestations = First[screenedVitalDistances[[key]][[2]]]; farstations = Last[screenedVitalDistances[[key]][[2]]]; nearstationPair = {entranceFromSimplifiedData[whichline, closestations[[1]][[1]]][[1]], entranceFromSimplifiedData[whichline, closestations[[1]][[2]]][[ 1]]}; farstationPair = {entranceFromSimplifiedData[whichline, farstations[[1]][[1]]][[1]], entranceFromSimplifiedData[whichline, farstations[[1]][[2]]][[1]]}; lineEntrances = entranceFromLine[whichline]; closestToPointer = First[SortBy[Map[{#, GeoDistance[#[[2]], p]} &, lineEntrances], Last]]; Column[{Show[bg, Graphics[{PointSize[Medium], Blue, Table[Tooltip[Point[lineEntrances[[i]][[2]]], lineEntrances[[i]][[1]]], {i, 1, Length[lineEntrances]}]}], Graphics[{PointSize[Large], Red, {Tooltip[Point[Last[farstationPair[[1]]]], First[farstationPair[[1]]]], Tooltip[Point[Last[farstationPair[[2]]]], First[farstationPair[[2]]]]}}], Graphics[{PointSize[Medium], Orange, {Tooltip[Point[Last[nearstationPair[[1]]]], First[nearstationPair[[1]]]], Tooltip[Point[Last[nearstationPair[[2]]]], First[nearstationPair[[2]]]]}}], Graphics[{PointSize[Large], Black, {Tooltip[Point[Last[closestToPointer[[1]]]], First[closestToPointer[[1]]]]}}], PlotRange -> {{-74.05(*w*), -73.69(*e*)}, {40.54(*s*), 40.92(*n*)}}, ImageSize -> 340, ImagePadding -> 2], Style[TableForm[{closestations, farstations, {closestToPointer[[1]][[1]], closestToPointer[[2]]}}, TableHeadings -> {{"closest pair", "farthest pair", "closest to pointer"}, {"stations", "distance (m)"}}], FontFamily -> "Times"]}, Center, 3]], {{whichline, "F", "which line?"}, index[[All, 2]]}, {{p, {-73.87, 40.73}}, Locator}]

In our next installment, what are the most distant stations in the whole system?

What are the most worthless subway stations in New York?

I was recently riding on the subway and it ocurred to me that some stations are much, much closer together than is typical. You can ride 30 blocks from West 42nd Street to West 72nd in a straight shot on the 2 or 3 lines, or 39 blocks between East 86th Street to East 125th Street on the 4 or the 5, but then sometimes you’re stuck mosying along between 110th and 116th, or between 14th and 18th. I heard a comedian make a joke about the pointlessness of stations four blocks apart, years ago. Nobody laughed, but I think he was onto something — some of these stops are very close together.

But how close? Scads of GIS files about New York City are freely available, and Mathematica has had the native ability to deal with these since version 7, though I’ve never used these functions. Let’s see what we can find out about the New York City Subways.

I used borough outlines from http://www.baruch.cuny.edu/geoportal/data/nymag/ and subway entrance locations from http://mta.info/developers/sbwy_entrance.html. I found three potentially tricky elements to the calculation \[LongDash] first, the subway entrances are not necessarily centered around their respective subway platforms; I chose a single entrance arbitrarily for each station but if I chose an outlying one, it will introduce a small error in my result. Second, the subway entrace locations are in {latitude,longitude} format (actually, {latitude,longitude}*1,000,000), and this is the format expected by Mathematica’s GeoDistance[] function, but the basemap is formated in {longitude, latitude} format, so we have to reverse the order of the elements in each location when we switch between mapping and determining distances. Third, the city’s database of subway entrances identifies an entrance as pertaining to a given line even if it is connected to that line only by an underground pedestrian tunnel. Since we care about what the trains are doing only, these have to be screened out by hand.

stations = Import[NotebookDirectory[] <> "StationEntrances.csv"];

allLines = Union[Flatten[Drop[stations, 1][[All, Range[4, 14]]]]]; allLines = Select[allLines, # != "" &]; index = Table[{i, allLines[[i]]}, {i, 1, Length[allLines]}];
This function lets us extract all subway stations pertaining to a given line.

matchLine[line_] := Union[Select[ Drop[stations, 1], #[[4]] == line || #[[5]] == line || #[[6]] == line || #[[7]] == line || #[[8]] == line || #[[9]] == line || #[[10]] == line || #[[11]] == line || #[[12]] == line || #[[13]] == line || #[[14]] == line &], SameTest -> (#1[[3]] == #2[[3]] &)]

byLineUnique = Map[matchLine[#] &, allLines];

Let’s graph it to see if this looks reasonable. Note that I specify the PlotRange to eliminate Staten Island, which has no subways.
bg = Import[NotebookDirectory[] <> "nymag_nyc_geog/nyc_pumas_2008.shp"];

Show[bg, Graphics[{PointSize[Medium], Red, Point@(Map[Reverse, Drop[stations[[All, {25, 26}]], 1]/1000000.])}], PlotRange -> {{-74.05(*w*), -73.69(*e*)}, {40.54(*s*), 40.92(*n*)}}]

Subway entrances as rendered by Mathematica

Looks good to me. We could easily graph each line in a difference color, connect the dots with lines, etc., but this will do for now.

The station entrance data includes all sorts of things we don’t care about, so let’s simplify it.

vitalTable = Table[{index[[i]][[2]], Map[{#[[3]], {#[[26]], #[[25]]}/10^6} &, byLineUnique[[i]]] // N}, {i, 1, Length[byLineUnique]}];

For the moment I don’t care about the order the stations are in, I’m going to check every station against every other one.

vitalDistances = Table[{index[[i]][[2]], SortBy[Union[ Map[{#[[All, 1]], Round[GeoDistance[Reverse[#[[1]][[2]]], Reverse[#[[2]][[2]]]], .1]} &, Permutations[vitalTable[[i]][[2]], {2}]]], Last]}, {i, 1, Length[byLineUnique]}];

Below is a list of subway stations combinations that aren’t traversed by the trains themselves and therefore shouldn’t be counted as “too close”. We’ll screen these out.

disallowedCombos = {{"14 St", "6 Av"}, {"South Ferry", "Whitehall St-South Ferry"}, {"Chambers St", "Park Place"}, {"Atlantic Av", "Atlantic Av-Pacific St"}, {"Brooklyn Bridge-City Hall", "Chambers St"}, {"Franklin Av", "Botanic Garden"}, {"Botanic Garden", "Franklin Av"}, {"59 St", "Lexington Av/59 St"}, {"51 St", "Lexington Av/53 St"}, {"74 St-Broadway", "Jackson Heights-Roosevelt Av"}, {"14 St", "8 Av"}, {"62 St", "New Utrecht Av"}, {"Park Place", "World Trade Center"}, {"42 St-Bryant Pk", "5 Av"}, {"Lorimer St", "Metropolitan Av"}, {"Court Sq", "Court Sq-23 St"}, {"42 St-Port Authority Bus Terminal", "Times Sq-42 St"}, {"Chambers St", "World Trade Center"}, {"Borough Hall", "Court St"}};


screenedVitalDistances =

  Table[{index[[i]][[2]],

    Select[SortBy[

      Union[vitalDistances[[i]][[2]],

       SameTest -> (#1[[1]] == Reverse[#2[[1]]] &)],

      Last], ! MemberQ[disallowedCombos, #[[1]]] &]}, {i, 1,

    Length[byLineUnique]}];

Map[{#[[1]], First[#[[2]]][[1]], First[#[[2]]][[2]], Last[#[[2]]][[1]], Last[#[[2]]][[2]]} &, screenedVitalDistances]

The stations closest to each other and farthest from each other for each subway line in New York.

In the next post, I will demonstrate an interactive tool (which I have also uploaded to the Wolfram Demonstrations Project) that shows all the stations for a given line, marking the closest ones in green and the farthest ones in red, with tooltips identifying every station.

A perfect solution to 24s

In several recent posts, I have discussed a method for perfect solution to the game of 24s. Here, I summarize some of those results and link to the Mathematica notebook I used to find them, and a comprehensive list of solutions for those who would like to skip the math and get right to the answer.

To recap, there are 715 possible hands in a game of 24s. 566 of those hands can be solved with arithmetics alone, 595 if we add roots, powers, and logs, and 619 if we add Mod.

I generally play with Log and Power and Root allowed, but not Mod. I find that the hands that are solvable only with Log, Power, and Root, generally are solvable by humans. See for example

{1,1,8,9} -> 8 Root[9,1+1]
{1,2,9,9} -> (9-1) Root[9,2]
{2, 2, 2, 6} -> 2^(2+Log[2,6]) or (2*Root[6,2])^2
{2,2,9,9} -> 2(Root[9,2]+9)
{3,7,8,10} -> 7 Root[8,3] +10
{3,4,9,10} -> 4+Log[3,9]*10
{7, 8, 9, 9} -> 8 Root[9,9-7], and
{8, 8, 9, 10} -> 8 Root[9,10-8].

A bright high school student should be able to find any of those. However, Mod introduces solutions that would probably never be played by an unaided human being. For example,

{5, 6, 7, 10} -> 6*Mod[7^10,5], and

{5,7,7,8} -> 8*Mod[7^7,5].

There is no other solution in these two cases.

Anyway, I’ve provided a comprehensive list of solutions linked as RTF and PDF documents, and the Mathematica notebook that I used to solve the problem in the first place.

All 24s in PDF.

Again, but as an RTF file.

And, most importantly, the Mathematica notebook.

Let me know if you do anything interesting with this.

Solving 24s, part iii: mod

I have shown how to comprehensively solve a game of 24s that allows just arithmetic, and how to expand that to allow powers, roots, and logarithms. Now we’re going to add the modulo (Mod) function.

If you’ve followed the methodology in parts i and ii, this is pretty simple stuff.

First, to our set of simplified function definitions, I add
mod[x_Integer?(# > 0 &), y_Integer?(-16 <= # <= 16 && # != 0 &)] := Mod[x, y]; mod[x__] := Indeterminate;

Then, our rule definition step looks like
rules = generateRules[{Times, Plus, Subtract, divide, power, root, log, mod}, 4];

We're up to 512 rules here, once redundant patterns are eliminated. The complexity increases exponentially with the number of operators allowed.

The good news is that Mod makes lots of hands solvable that wouldn't be otherwise. See, for example,

{3, 5, 5, 10} -> 5*5-Mod[10,3]
{5, 6, 7, 10} -> 6*Mod[7^10,5]
{5, 7, 7, 8} -> Mod[7^7,5]*8, and
{6, 7, 7, 9} -> 6*Mod[7*7,9]

In fact, allowing Mod as an operator increases the total number of solvable hands all the way to 619 out of 715.

The unsolvable hands that remain are
{{1, 1, 1, 1}, {1, 1, 1, 2}, {1, 1, 1, 3}, {1, 1, 1, 4}, {1, 1, 1, 6}, {1, 1, 1, 7}, {1, 1, 1, 9}, {1, 1, 1, 10}, {1, 1, 2, 2}, {1, 1, 2, 3}, {1, 1, 5, 9}, {1, 1, 5, 10}, {1, 1, 6, 7}, {1, 1, 6, 10}, {1, 1, 7, 7}, {1, 1, 7, 8}, {1, 1, 7, 9}, {1, 1, 8, 10}, {1, 1, 9, 9}, {1, 1, 9, 10}, {1, 1, 10, 10}, {1, 2, 2, 2}, {1, 2, 9, 10}, {1, 2, 10, 10}, {1, 4, 9, 9}, {1, 5, 7, 7}, {1, 6, 6, 7}, {1, 6, 7, 7}, {1, 6, 7, 8}, {1, 6, 10, 10}, {1, 7, 7, 7}, {1, 7, 7, 8}, {1, 7, 10, 10}, {1, 8, 9, 9}, {1, 8, 9, 10}, {1, 8, 10, 10}, {1, 9, 9, 9}, {1, 9, 9, 10}, {1, 9, 10, 10}, {1, 10, 10, 10}, {2, 2, 2, 2}, {2, 2, 7, 9}, {2, 6, 7, 7}, {2, 7, 7, 7}, {2, 7, 7, 9}, {2, 7, 9, 9}, {2, 9, 9, 9}, {2, 9, 9, 10}, {2, 10, 10, 10}, {3, 3, 10, 10}, {3, 5, 7, 7}, {3, 10, 10, 10}, {4, 4, 9, 9}, {4, 7, 7, 9}, {4, 7, 7, 10}, {4, 9, 9, 9}, {5, 5, 5, 7}, {5, 5, 5, 8}, {5, 5, 5, 10}, {5, 5, 6, 10}, {5, 5, 7, 9}, {5, 7, 7, 7}, {5, 7, 9, 9}, {5, 8, 10, 10}, {5, 9, 9, 9}, {5, 10, 10, 10}, {6, 6, 6, 7}, {6, 6, 7, 7}, {6, 6, 7, 8}, {6, 7, 7, 7}, {6, 7, 7, 8}, {6, 9, 10, 10}, {7, 7, 7, 7}, {7, 7, 7, 8}, {7, 7, 7, 9}, {7, 7, 7, 10}, {7, 7, 8, 8}, {7, 7, 8, 9}, {7, 7, 9, 9}, {7, 8, 8, 8}, {7, 9, 9, 9}, {7, 9, 9, 10}, {7, 9, 10, 10}, {7, 10, 10, 10}, {8, 8, 8, 8}, {8, 8, 9, 9}, {8, 8, 10, 10}, {8, 9, 9, 9}, {8, 9, 9, 10}, {8, 9, 10, 10}, {8, 10, 10, 10}, {9, 9, 9, 9}, {9, 9, 9, 10}, {9, 9, 10, 10}, {9, 10, 10, 10}, {10, 10, 10, 10}}

Solving 24s, part ii: powers, roots, and logarithms

In my last post on this topic, I showed that 566 of 715 possible hands in the game of 24s could be solved with simple arithmetic. However in practice people don’t play with just those operators. In my experience, a normal game always includes powers, roots, and logarithms. The solution method used for arithmetics works just fine when expanded for these additional functions, though I chose to make some small modifications for the sake of efficiency.

First, I redefine the functions to remove some cases that are quite unlikely to produce 24s, and which would increase memory needs and processing time significantly if left in place. See the following, for example:

Off[General::"spell1"]; Off[General::"spell"]; power[x_?(# > 0 &), y_?(-16 <= # <= 16 &)] := x^y; power[x__] := Indeterminate; root[x_Integer?(# > 0 &), y_Integer?(-16 <= # <= 16 && # != 0 &)] := Power[x, (y)^-1]; root[x__] := Indeterminate; divide[x_, y_] /; y != 0 := Divide[x, y]; divide[x_, 0] := Indeterminate; log[x_?(# > 0 &), y_?(# > 0 &)] := Log[x, y]; log[x__] := Indeterminate;

We can then generate rules as in the arithmetic case but with

rules = generateRules[{Times, Plus, Subtract, divide, power, root, log}, 4];

and run legitCombos as before. The result? 595 hands are solvable, as opposed to the 566 we had before. hands that can be solved now, that couldn’t be solved before, include

{1,1,8,9} -> 8 Root[9,1+1] and trivial variations
{1,2,9,9} -> (9-1) Root[9,2] and trivial variations
{2, 2, 2, 6} -> 2^(2+Log[2,6]) or (2*Root[6,2])^2 and trivial variations
{2,2,9,9} -> 2(Root[9,2]+9) and trivial variations
{3,7,8,10} -> 7 Root[8,3] +10 and trivial variations
{3,4,9,10} -> 4+Log[3,9]*10 and trivial variations
{7, 8, 9, 9} -> 8 Root[9,9-7] and trivial variations, and
{8, 8, 9, 10} -> 8 Root[9,10-8] and trivial variations.

The hands that remain unsolvable are

{{1, 1, 1, 1}, {1, 1, 1, 2}, {1, 1, 1, 3}, {1, 1, 1, 4}, {1, 1, 1, 6}, {1, 1, 1, 7}, {1, 1, 1, 9}, {1, 1, 1, 10}, {1, 1, 2, 2}, {1, 1, 2, 3}, {1, 1, 5, 9}, {1, 1, 5, 10}, {1, 1, 6, 7}, {1, 1, 6, 10}, {1, 1, 7, 7}, {1, 1, 7, 8}, {1, 1, 7, 9}, {1, 1, 8, 10}, {1, 1, 9, 9}, {1, 1, 9, 10}, {1, 1, 10, 10}, {1, 2, 2, 2}, {1, 2, 9, 10}, {1, 2, 10, 10}, {1, 4, 9, 9}, {1, 5, 7, 7}, {1, 6, 6, 7}, {1, 6, 7, 7}, {1, 6, 7, 8}, {1, 6, 10, 10}, {1, 7, 7, 7}, {1, 7, 7, 8}, {1, 7, 10, 10}, {1, 8, 9, 9}, {1, 8, 9, 10}, {1, 8, 10, 10}, {1, 9, 9, 9}, {1, 9, 9, 10}, {1, 9, 10, 10}, {1, 10, 10, 10}, {2, 2, 2, 2}, {2, 2, 7, 9}, {2, 6, 7, 7}, {2, 7, 7, 7}, {2, 7, 7, 9}, {2, 7, 8, 10}, {2, 7, 9, 9}, {2, 9, 9, 9}, {2, 9, 9, 10}, {2, 10, 10, 10}, {3, 3, 4, 10}, {3, 3, 10, 10}, {3, 4, 6, 7}, {3, 5, 5, 10}, {3, 5, 7, 7}, {3, 5, 8, 10}, {3, 10, 10, 10}, {4, 4, 5, 9}, {4, 4, 6, 6}, {4, 4, 6, 7}, {4, 4, 9, 9}, {4, 4, 9, 10}, {4, 7, 7, 9}, {4, 7, 7, 10}, {4, 9, 9, 9}, {4, 9, 10, 10}, {4, 10, 10, 10}, {5, 5, 5, 7}, {5, 5, 5, 8}, {5, 5, 5, 10}, {5, 5, 6, 9}, {5, 5, 6, 10}, {5, 5, 7, 9}, {5, 6, 7, 10}, {5, 7, 7, 7}, {5, 7, 7, 8}, {5, 7, 9, 9}, {5, 8, 9, 9}, {5, 8, 10, 10}, {5, 9, 9, 9}, {5, 9, 9, 10}, {5, 10, 10, 10}, {6, 6, 6, 7}, {6, 6, 7, 7}, {6, 6, 7, 8}, {6, 6, 10, 10}, {6, 7, 7, 7}, {6, 7, 7, 8}, {6, 7, 7, 9}, {6, 7, 8, 8}, {6, 8, 10, 10}, {6, 9, 9, 9}, {6, 9, 10, 10}, {7, 7, 7, 7}, {7, 7, 7, 8}, {7, 7, 7, 9}, {7, 7, 7, 10}, {7, 7, 8, 8}, {7, 7, 8, 9}, {7, 7, 8, 10}, {7, 7, 9, 9}, {7, 7, 10, 10}, {7, 8, 8, 8}, {7, 9, 9, 9}, {7, 9, 9, 10}, {7, 9, 10, 10}, {7, 10, 10, 10}, {8, 8, 8, 8}, {8, 8, 8, 9}, {8, 8, 9, 9}, {8, 8, 10, 10}, {8, 9, 9, 9}, {8, 9, 9, 10}, {8, 9, 10, 10}, {8, 10, 10, 10}, {9, 9, 9, 9}, {9, 9, 9, 10}, {9, 9, 10, 10}, {9, 10, 10, 10}, {10, 10, 10, 10}}.

Next chapter, we add Mod[].

monkeywrench

Mathematica in Finance, mostly, I expect