Analysis Methodology
Statistical analyses of legislation and legislators provide context for the legislative process. Of all of the 10,000+ bills pending at any given time, our unique analyses help GovTrack visitors know what is relevant and what to pay attention to.
Ideology
The Ideology Analysis compares the sponsorship and cosponsorship patterns of Members of Congress to put them on a scale roughly from liberal to conservative. Read More »
Prognosis
The Prognosis Analysis looks at the factors that help or hurt a bill’s chance of getting out of committee and being enacted. It is based on a regression model. Read More »
Leadership
The Leadership Analysis looks at who is cosponsoring whose bills to see who the legislative leaders are. It’s a little like if you scratch my back will I scratch yours? The analysis is based on Google PageRank, the algorithm Google uses to order search results. Read More »
Ideology Analysis of Members of Congress
The ideology analysis assigns a liberal–conservative score to each Member of Congress based on his or her pattern of cosponsorship.
In a nutshell, Members of Congress who cosponsor similar sets of bills will get scores close together, while Members of Congress who sponsor different sets of bills will have scores far apart. Members of Congress with similar political views will tend to cosponsor the same set of bills, or bills by the same set of authors, and inversely Members of Congress with different political views will tend to cosponsor other congressmen’s bills.
You can find this analysis on the pages for current Members of Congress.
The charts to the right plot the ideology score on the horizontal axis and the leadership score on the vertical axis. Look at the extremes. For instance, Doug Lamborn appears as the most extreme Republican in the House chart. In February 2010 The National Journal named Lamborn the most conservative member of the House.
Overview
The data that goes into this analysis is a list of who sponsored or cosponsored which bills. The process doesn’t look at the content of the bills or the party affiliation or anything else about the Members of Congress, but it is able to infer underlying behavioral patterns, some of which correspond to real-world concepts like left-right ideology.
You’ll see in the charts on the right that the ideology analysis does a good job at separating the Democrats from the Republicans, and within each party the moderates from the extremes. If you wanted to know how your representatives stood in relation to their peers ideologically, this chart is a good place to start.
We first began publishing this analysis in 2004, then calling it a political spectrum. A similar analysis by Professor Keith Poole using voting records rather than cosponsorship produces similar results: see voteview.com. (As far as we know, we were the first to apply this sort of analysis to cosponsorship behavior.)
Methodology
The statistical method behind this analysis is Principal Components Analysis, also known as dimensionality reduction. Principal Components Analysis is a statistical technique that reveals underlying patterns in data.
Here’s how it works: Form a matrix (a grid of numbers) with columns representing Members of Congress and rows also representing Members of Congress. Do this for the House and Senate separately. We include (co)sponsorship from the current and previous two Congresses, so between four and six years of data. For the Senate, you have a 100x100 table. In each cell of the table, put the number of times the senator for the row cosponsored a bill introduced by the senator for the column. Or if it's the same senator in the row and column, put in the number of bills he or she introduced. Then compute the singular value decomposition of the matrix (which is how Principal Components Analysis is often done).
Every square matrix has a singular value decomposition. The magic is in how you interpret it. The singular value decomposition takes one matrix and gives you back three: called u, s, and v-transpose. V-transpose can be interpreted as a set of scores for each Member of Congress on a new set of dimensions. The dimensions are ranked in order by how much of the original data they explain. We have found that the second dimension best corresponds with ideology. We use the scores from that dimension in our charts.
Each score is a number. It’s entirely arbitrary whether liberal or conservative is positive or negative — the original matrix is blind to actual information like that. In fact, there’s no guarantee that these numbers even have anything to do with liberal- and conversative-ness. All it tells us is how to separate Members of Congress into two groups, or more precisely how to spread them out along a spectrum in a way that explains their record of cosponsorship. But in practice it captures ideology very well.
(In the original version of this analysis called the political spectrum, the rows were Members of Congress and the columns were bills. That is, form a matrix with a 1 in each cell where the Member of Congress corresponding to the row sponsored or cosponsored the bill corresponding to the column. The change was made only to reuse the source code with the leadership analysis, which needs a member-member matrix.)
Data
The ideology scores can be found in two CSV files sponsorshipanalysis_h.txt and sponsorshipanalysis_s.txt (House and Senate) over here.
Source Code
Running this analysis is pretty simple in Python. It is literally two lines. Assuming you have the cosponsorship matrix in P:
u, s, vT = numpy.linalg.svd(P) ideology = vT[1,:]
The full source code for this analysis can be found on github.
Citation
To cite our methodology and results, we recommend either of these:
GovTrack.us. 2013. Ideology Analysis of Members of Congress. Accessed at http://www.govtrack.us/about/analysis.
Tauberer, Joshua. 2012. Observing the Unobservables in the U.S. Congress, presented at Law Via the Internet 2012, Cornell Law School, October 2012. [text | slides | video]
References
For more on how to use singular value decomposition, check out:
Wall, Rechtsteiner, and Rocha. “Singular value decomposition and principal component analysis.” in A Practical Approach to Microarray Data Analysis. D.P. Berrar, W. Dubitzky, M. Granzow, eds. pp. 91-109, Kluwer: Norwell, MA (2003). LANL LA-UR-02-4001.
Leadership Analysis of Members of Congress
A leadership score is computed for each Member of Congress by looking at how often other Members of Congress cosponsor their bills — more or less. The analysis is based on PageRank, Google’s algorithm for ranking pages on the web.
The idea behind a leadership score is that if X cosponsors Y’s bills but Y does not cosponsor X’s bills, then X is a follower relative to Y being a leader.
You can find this analysis on the pages for current Members of Congress.
The charts to the right plot the leadership score on the vertical axis and the ideology score on the horizontal axis.
There are some interesting things in this chart. There’s a distinct V-shape. Congressional leaders appear to be more extreme. There are some confounding effects to consider here. Leaders tend to be more senior members of Congress, they tend to be older, and they have had more time to participate in legislating. But somewhere among those factors there’s an interesting correlation to having an extreme political ideology.
These leadership and ideology scores give us a view into Congress that is normally hidden to us. We can’t observe leadership. We’re not there, in Congress, to see it. We’re not in the meetings where you can see relationships form. But those relationships are known to the congressmen and senators. It’s obvious to them. They know whether they lead or follow. Their staff know. This is a sort of social knowledge that is locked within the institution of Congress, unless we get a little creative with how we try to observe it.
Overview
The data that goes into this analysis is a list of who sponsored or cosponsored which bills. The process doesn’t look at the content of the bills or anything else about the Members of Congress, but it is able to infer underlying behavioral patterns, some of which correspond to real-world concepts like leadership.
We first began publishing leadership scores in 2010. As far as we know, this analysis is unique to GovTrack.
Methodology
The inspiration for this analysis comes from Google’s PageRank algorithm, which governs how Google ranks the order of pages in its search results. Google’s method is widely known: the more links you get to your website from other websites, and the more links those other websites have, the higher your PageRank and the higher up in search results you appear.
Here’s how we apply it to Congress: the more Members of Congress that cosponsor Member X’s bills, and the more cosponsors those other Members of Congress have, the higher X’s leadership score.
We start by forming a matrix (a grid of numbers) with cosponsorship data. It is the same matrix as in the ideology analysis, so see the methodology section there for details. Then we run the PageRank algorithm on the matrix, which yields a new number for each Member of Congress. That is the leadership score.
This analysis came from a suggestion from Joseph Barillari (who GovTrack’s creator knew in college). (The original formulation of the score for Member of Congress X was the mean across all other Members of Congress Y of the log of the number of bills sponsored by X and cosponsored by Y divided by the number of bills sponsored by Y and cosponsored by X.)
Data
The leadership scores can be found in two CSV files sponsorshipanalysis_h.txt and sponsorshipanalysis_s.txt (House and Senate) over here.
Source Code
Here is pseudo-code in Python. Assuming you have the cosponsorship matrix in P:
x = numpy.ones( (N, 1) ) / float(N)
while True:
y = numpy.dot(P, x)
if onenorm(y-x) < .00000000001: break
x = y
def onenorm(u): return sum(abs(u))
The full source code for this analysis can be found on github.
Citation
To cite our methodology and results, we recommend either of these:
GovTrack.us. 2013. Leadership Analysis of Members of Congress. Accessed at http://www.govtrack.us/about/analysis.
Tauberer, Joshua. 2012. Observing the Unobservables in the U.S. Congress, presented at Law Via the Internet 2012, Cornell Law School, October 2012. [text | slides | video]
References
Kamvar, Sep. 2010. Numerical algorithms for personalized search in self-organizing information networks. Princeton University Press.
Bill Prognosis Analysis
GovTrack computes a prognosis for each bill, which is the probability that the bill will be enacted. Our computation is based on factors that are correlated with successful or failed bills in the past, such as whether the sponsor is a committee chair.
What is the point of this?
- More than 10,000 bills will be considered by each Congress. About 4% will become law. Which bills should we focus on?
- Congressmen and senators, their staff, and lobbyists all know what bills are important because they have the institutional knowledge of what makes a bill important. The prognosis highlights the factors that make a bill successful.
The prognosis scores can be found on the pages for bills throughout the site.
Overview
The data that goes into this analysis are factors that we compute for bills, such as whether the sponsor is a committee chair (see right for a full list). We “train” the model on bills from the 112th Congress (2011-2013) using the factors and whether a) the bill made it out of committee and b) was enacted, and we use that to compute probabilities for bills in the current Congress.
We first began publishing leadership scores in 2012. As far as we know, we were the first to apply this analysis to Congressional bills.
Methodology
This analysis is based on a logistic regression. Logistic regression is similar to simple linear regression but it is more appropriate when modeling probabilities.
The model is trained on bills from the 112th Congress (2011-2013). The explanatory variables are the binary factors mentioned above and listed at right. We create sixteen separate models. For each chamber (House and Senate), and for each of the four types of bills and resolutions in that chamber, we compute one model that predicts whether a bill will get out of committee and a separate model that computes, for bills out of committee, whether the bill will be enacted (or for resolutions whether it will be agreed to).
The output of the logistic regression models are weights assigned to the factors, called β in the table at the right. The prognosis score for a bill is computed by multiplying all of the factors together that apply to the bill (more or less, see logistic regression on Wikipedia for details).
In choosing the factors for model, we select from a large set of plausible factors those which appear to be statistically significant on their own (using a binomial distribution). After the logistic regression, we remove factors that appear statistically non-significant and re-compute the model.
Citation
To cite our methodology and results, we recommend either of these:
GovTrack.us. 2013. Bill Prognosis Analysis. Accessed at http://www.govtrack.us/about/analysis.
Tauberer, Joshua. 2012. Observing the Unobservables in the U.S. Congress, presented at Law Via the Internet 2012, Cornell Law School, October 2012. [text | slides | video]
References
Here is some academic work on the same subject:
Tae Yano, Noah A. Smith, and John D. Wilkerson. "Textual Predictors of Bill Survival in Congressional Committees," at New Directions in Analyzing Text as Data 2012, 5-6 October at Harvard.
Results
The following tables show how various factors help or hurt a bill or resolution’s chance of making it out of committee and getting enacted (or agreed to). Two tables are given for each of the eight bill types.
In the tables, N is the number of bills/resolutions that had the indicated factor in the training corpus; %S is of bills with this factor, the percent that were successful (past committee or enacted); and β is the regression coefficient (weight) from the prognosis analysis. Higher weights increase the bill or resolution’s probability of success.
House of Representatives bills (H.R.) — sent out of committee to the floor
Overall, 11% of the 6,723 House of Representatives bills in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 71 | 69% | 3.3 | Title starts with "To designate the facility of the United States Postal". |
| 197 | 74% | 2.6 | Sponsor is a relevant committee chairman. |
| 247 | 61% | 2.1 | A cosponsor is a relevant committee chairman. |
| 34 | 26% | 1.6 | On a companion bill: Sponsor is a relevant committee chairman. |
| 112 | 24% | 1.3 | Referred to House Appropriations. |
| 134 | 29% | 1.1 | Referred to House Homeland Security. |
| 649 | 24% | 1.1 | Referred to House Natural Resources. |
| 44 | 23% | 1.0 | On a companion bill: Sponsor is in majority party and 1/3rd+ of cosponsors are in minority party. |
| 166 | 27% | 1.0 | Got past committee in a previous Congress. |
| 1,522 | 29% | 1.0 | Sponsor is on a relevant committee & in majority party. |
| 47 | 34% | 0.9 | On a companion bill: A cosponsor is a relevant committee ranking member. |
| 425 | 25% | 0.8 | A cosponsor is a relevant committee ranking member. |
| 80 | 23% | 0.7 | Referred to House Small Business. |
| 1,899 | 19% | 0.7 | Has cosponsors from both parties. |
| 160 | 26% | 0.6 | Referred to House Budget. |
| 631 | 23% | 0.3 | Cosponsor has high leadership score (majority party). |
| 727 | 15% | -0.4 | 3-5 cosponsors are on a relevant committee. |
| 1,012 | 8% | -0.5 | Referred to House Energy and Commerce. |
| 3,197 | 4% | -0.5 | Sponsor is a member of the minority party. |
| 793 | 8% | -0.5 | Cosponsor has high leadership score (minority party). |
| 264 | 15% | -0.6 | Referred to House Foreign Affairs. |
| 1,378 | 4% | -0.7 | Is a bill reintroduced from a previous Congress. |
| 321 | 5% | -1.1 | Referred to House Armed Services. |
| 552 | 4% | -1.2 | Referred to House Education and the Workforce. |
| 2,530 | 3% | -1.5 | Referred to House Ways and Means. |
| 41 | 0% | -29.9 | Title starts with "To extend and modify the temporary". |
| 35 | 0% | -30.5 | Title starts with "To suspend temporarily the duty on mixtures". |
| 37 | 0% | -30.5 | Title starts with "To extend the temporary reduction of duty on". |
| 31 | 0% | -30.5 | Title starts with "To suspend temporarily the rate of duty on certain". |
| 41 | 0% | -30.9 | Title starts with "To reduce temporarily the duty on". |
| 58 | 0% | -31.0 | Title starts with "To extend the temporary suspension of duty on certain". |
| 137 | 0% | -31.6 | Title starts with "To suspend temporarily the duty on certain". |
| 81 | 0% | -31.7 | Title starts with "To extend the suspension of duty on". |
Senate bills (S.) — sent out of committee to the floor
Overall, 12% of the 3,716 Senate bills in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 121 | 61% | 1.7 | Got past committee in a previous Congress. |
| 25 | 52% | 1.7 | Title starts with "A bill to designate". |
| 227 | 46% | 1.6 | Sponsor is a relevant committee chairman. |
| 23 | 30% | 1.0 | Title starts with "A bill to authorize". |
| 169 | 37% | 1.0 | A cosponsor is a relevant committee chairman. |
| 929 | 20% | 0.7 | Has cosponsors from both parties. |
| 216 | 19% | 0.7 | Has a companion bill sponsored by the other party. |
| 1,249 | 17% | 0.5 | Sponsor is on a relevant committee & in majority party. |
| 101 | 28% | 0.5 | On a companion bill: Referred to House Natural Resources. |
| 724 | 7% | -0.7 | Is a bill reintroduced from a previous Congress. |
| 193 | 22% | -1.3 | Referred to Senate Homeland Security and Governmental Affairs. |
| 290 | 22% | -1.3 | Referred to Senate Energy and Natural Resources. |
| 203 | 20% | -1.5 | Referred to Senate Environment and Public Works. |
| 74 | 22% | -1.5 | Referred to Senate Foreign Relations. |
| 325 | 19% | -1.6 | Referred to Senate Judiciary. |
| 187 | 17% | -1.8 | Referred to Senate Commerce, Science, and Transportation. |
| 103 | 6% | -2.7 | Referred to Senate Armed Services. |
| 172 | 5% | -2.8 | Referred to Senate Banking, Housing, and Urban Affairs. |
| 74 | 4% | -3.1 | Referred to Senate Agriculture, Nutrition, and Forestry. |
| 383 | 3% | -3.5 | Referred to Senate Health, Education, Labor, and Pensions. |
| 1,349 | 1% | -4.1 | Referred to Senate Finance. |
| 28 | 0% | -29.5 | Title starts with "A bill to extend and modify the temporary reduction". |
| 25 | 0% | -29.5 | Title starts with "A bill to renew the temporary suspension of duty". |
| 34 | 0% | -29.7 | Title starts with "A bill to reduce temporarily the duty on". |
| 80 | 0% | -30.4 | Title starts with "A bill to suspend temporarily the duty on certain". |
| 231 | 0% | -31.4 | Title starts with "A bill to extend the temporary suspension of duty". |
| 27 | 0% | -33.5 | Referred to Senate Budget. |
House of Representatives bills (H.R.) — enacted
Overall, 28% of the 710 House of Representatives bills that got past committee in 2011-2013 were enacted. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 49 | 84% | 3.2 | Title starts with "To designate the facility of the United States Postal". |
| 41 | 44% | 1.5 | Referred to House Budget. |
| 37 | 68% | 1.4 | Has a companion bill sponsored by the other party. |
| 18 | 50% | 1.2 | Introduced in the last 90 days of the Congress (incl. companion bills). |
| 26 | 69% | 1.1 | On a companion bill: Sponsor is on a relevant committee & in majority party. |
| 108 | 41% | 0.8 | A cosponsor is a relevant committee ranking member. |
| 72 | 42% | 0.7 | Referred to House Ways and Means. |
| 105 | 49% | 0.5 | Sponsor is in majority party and 1/3rd+ of cosponsors are in minority party. |
| 17 | 6% | -2.8 | Referred to House Rules. |
Senate bills (S.) — enacted
Overall, 16% of the 449 Senate bills that got past committee in 2011-2013 were enacted. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 42 | 40% | 1.3 | Referred to Senate Homeland Security and Governmental Affairs. |
| 215 | 10% | -0.8 | Sponsor is on a relevant committee & in majority party. |
| 17 | 0% | -34.3 | On a companion bill: A cosponsor is a relevant committee ranking member. |
House simple resolutions (H.Res.) — sent out of committee to the floor
Overall, 27% of the 845 House simple resolutions in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 41 | 10% | 36.8 | On a companion bill: Has cosponsors from both parties. |
| 121 | 99% | 5.0 | Title starts with "Providing for consideration of". |
| 21 | 48% | 4.5 | A cosponsor is a relevant committee chairman. |
| 34 | 9% | 4.3 | On a companion bill: Sponsor is a member of the minority party. |
| 33 | 48% | 3.2 | Got past committee in a previous Congress. |
| 33 | 82% | 1.8 | Sponsor is a relevant committee chairman. |
| 105 | 10% | 1.1 | Cosponsor has high leadership score (minority party). |
| 168 | 80% | -0.6 | Referred to House Rules. |
| 39 | 10% | -1.3 | Referred to House Armed Services. |
| 253 | 8% | -1.4 | Has cosponsors from both parties. |
| 84 | 2% | -1.6 | Title starts with "Expressing the sense of the House of Representatives that". |
| 57 | 7% | -1.9 | Referred to House Judiciary. |
| 402 | 8% | -1.9 | Sponsor is a member of the minority party. |
| 173 | 10% | -2.0 | Referred to House Foreign Affairs. |
| 31 | 3% | -2.0 | Referred to House Ways and Means. |
| 90 | 2% | -2.5 | Referred to House Oversight and Government Reform. |
| 88 | 1% | -3.0 | Referred to House Energy and Commerce. |
| 82 | 1% | -4.9 | Referred to House Education and the Workforce. |
| 25 | 0% | -34.3 | On a companion bill: 3-5 cosponsors are on a relevant committee. |
| 25 | 0% | -34.8 | Title starts with "Supporting the goals and ideals of National". |
| 64 | 0% | -35.1 | Is a bill reintroduced from a previous Congress. |
| 22 | 0% | -35.1 | Referred to House Natural Resources. |
| 51 | 0% | -35.8 | Title starts with "Expressing support for designation of". |
| 82 | 5% | -36.5 | Has a companion bill in the other chamber. |
| 15 | 0% | -36.9 | Referred to House Science, Space, and Technology. |
Senate simple resolutions (S.Res.) — sent out of committee to the floor
Overall, 73% of the 630 Senate simple resolutions in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 25 | 96% | 2.5 | Got past committee in a previous Congress. |
| 71 | 92% | 1.6 | Sponsor is in majority party and 1/3rd+ of cosponsors are in minority party. |
| 37 | 86% | 1.1 | Title starts with "A resolution congratulating the". |
| 319 | 83% | 1.0 | Has cosponsors from both parties. |
| 113 | 82% | 0.7 | Cosponsor has high leadership score (majority party). |
| 26 | 15% | -2.2 | Is a bill reintroduced from a previous Congress. |
| 84 | 45% | -3.3 | Referred to Senate Judiciary. |
| 77 | 44% | -3.5 | Referred to Senate Foreign Relations. |
| 31 | 42% | -3.5 | Referred to Senate Health, Education, Labor, and Pensions. |
| 31 | 19% | -3.7 | Title starts with "A resolution expressing the sense of the Senate that". |
| 34 | 12% | -4.2 | Referred to Senate Rules and Administration. |
Senate simple resolutions (S.Res.) — agreed to
Overall, 98% of the 462 Senate simple resolutions that got past committee in 2011-2013 were agreed to. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 34 | 76% | -4.9 | Referred to Senate Foreign Relations. |
House simple resolutions (H.Res.) — agreed to
Overall, 96% of the 228 House simple resolutions that got past committee in 2011-2013 were agreed to. The following factors help or hurt that:
There were no statistically significant factors in the model.
House concurrent resolutions (H.Con.Res.) — sent out of committee to the floor
Overall, 29% of the 147 House concurrent resolutions in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 20 | 5% | 32.3 | On a companion bill: 6+ cosponsors are on a relevant committee. |
| 35 | 9% | -1.4 | Referred to House Foreign Affairs. |
| 19 | 5% | -2.4 | Is a bill reintroduced from a previous Congress. |
| 27 | 4% | -34.6 | Has a companion bill in the other chamber. |
| 37 | 0% | -37.4 | Title starts with "Expressing the sense of Congress that". |
House joint resolutions (H.J.Res.) — sent out of committee to the floor
Overall, 12% of the 122 House joint resolutions in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 17 | 53% | 3.0 | Sponsor is on a relevant committee & in majority party. |
| 62 | 0% | -38.1 | Title starts with "Proposing an amendment to the Constitution of the United". |
Senate concurrent resolutions (S.Con.Res.) — sent out of committee to the floor
Overall, 65% of the 65 Senate concurrent resolutions in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 19 | 42% | 33.0 | On a companion bill: 6+ cosponsors are on a relevant committee. |
| 20 | 40% | -34.5 | Has a companion bill in the other chamber. |
Senate joint resolutions (S.J.Res.) — sent out of committee to the floor
Overall, 41% of the 51 Senate joint resolutions in 2011-2013 were sent out of committee to the floor. The following factors help or hurt that:
| N | %S | β | Factor |
|---|---|---|---|
| 21 | 19% | -1.7 | Referred to Senate Judiciary. |
House concurrent resolutions (H.Con.Res.) — agreed to
Overall, 70% of the 43 House concurrent resolutions that got past committee in 2011-2013 were agreed to. The following factors help or hurt that:
There were no statistically significant factors in the model.
Senate concurrent resolutions (S.Con.Res.) — agreed to
Overall, 48% of the 42 Senate concurrent resolutions that got past committee in 2011-2013 were agreed to. The following factors help or hurt that:
There were no statistically significant factors in the model.
Senate joint resolutions (S.J.Res.) — enacted or passed
Overall, 29% of the 21 Senate joint resolutions that got past committee in 2011-2013 were enacted or passed. The following factors help or hurt that:
There were no statistically significant factors in the model.
House joint resolutions (H.J.Res.) — enacted or passed
Overall, 40% of the 15 House joint resolutions that got past committee in 2011-2013 were enacted or passed. The following factors help or hurt that:
There were no statistically significant factors in the model.
Did it work? The following charts compare the prognoses computed for bills to their actual rate of success. The prognosis model for these charts was trained on the 111th Congress and tested on the 112th Congress.
For each regression model, the bills are divided into 10 bins by prognosis. The median prognosis is plotted on the horizontal axis and the percentage of successful bills in the bin is plotted on the vertical axis.
You can see that the prognosis overestimated the actual chances of success, but the rough upward slope in most of the charts shows that the prognosis was often predictive of a bill’s future.
House of Representatives bills (H.R.) — sent out of committee to the floor
Senate bills (S.) — sent out of committee to the floor
House of Representatives bills (H.R.) — enacted
Senate bills (S.) — enacted
House simple resolutions (H.Res.) — sent out of committee to the floor
Senate simple resolutions (S.Res.) — sent out of committee to the floor
Senate simple resolutions (S.Res.) — agreed to
House simple resolutions (H.Res.) — agreed to
House concurrent resolutions (H.Con.Res.) — sent out of committee to the floor
House joint resolutions (H.J.Res.) — sent out of committee to the floor
House concurrent resolutions (H.Con.Res.) — agreed to
Senate concurrent resolutions (S.Con.Res.) — agreed to
Here are some additional charts for machine learning researchers.
The charts below show precision vs. recall plotted parametrically for various values of a success-fail threshold t. Bills with prognosis above t are predicted successes for the purposes of these charts. The prognosis model for these charts was trained on the 111th Congress and tested on the 112th Congress.