Summaries of GPI's working papers

Below you can find summaries of some working papers written by GPI researchers. The full text of these papers as well as other working papers can be found on our papers page.

	
			
	

	
	
		
	
		

			
				Summary: Will AI avoid exploitation (Adam Bales)
			

			
			
			
			
		

	
	
	
		
				
			We might hope that there is a straightforward way of predicting the behaviour of future artificial intelligence (AI) systems. Some have suggested that AI will maximise expected utility, because anything else would allow them to accept a series of trades that result in a guaranteed loss of something valuable (Omohundro, 2008). Indeed, we would be able to predict AI behaviour if…
					
		
		
	

	



	
	
		
	
		

			
				Summary: The scope of longtermism (David Thorstad)
			

			
			
			
			
		

	
	
	
		
				
			Recent work argues for longtermism–the position that often our morally best options will be those with the best long-term consequences. Proponents of longtermism sometimes suggest that in most decisions expected long-term benefits outweigh all short-term effects. In ‘The scope of longtermism’, David Thorstad argues that most of our decisions do not have this character. He identifies three features…
					
		
		
	

	



	
	
		
	
		

			
				Summary: Simulation expectation (Teruji Thomas)
			

			
			
			
			
		

	
	
	
		
				
			At some point in the future we may invent sophisticated simulations. If we do so, we could run millions of simulations of minor variants of the 21st century, each inhabited by simulated people. To those simulated people, it will appear as if they really lived in the 21st century. But that is exactly how our world appears to us, and perhaps we live in a simulation. …
					
		
		
	

	



	
	
		
	
		

			
				Summary: High risk, low reward: A challenge to the astronomical value of existential risk mitigation (David Thorstad)
			

			
			
			
			
		

	
	
	
		
				
			The value of the future may be vast. Human extinction, which would destroy that potential, would be extremely bad. Some argue that making such a catastrophe just a little less likely would be by far the best use of our limited resources––much more important than, for example, tackling poverty, inequality, global health or racial injustice. In “High risk, low reward: A challenge to the astronomical…
					
		
		
	

	



	
	
		
	
		

			
				Summary: When should an effective altruist donate? (William MacAskill)
			

			
			
			
			
		

	
	
	
		
				
			Effective altruists seek to do as much good as possible given limited resources. Often by donating to important causes like global health and poverty, farmed animal welfare, and reducing existential risks. Can we help more by donating now or later? This is the thorny question William MacAskill tackles in the paper “When should an effective altruist donate?”. He explores several considerations…
					
		
		
	

	



	
	
		
	
		

			
				Summary: Are we living at the hinge of history? (William MacAskill)
			

			
			
			
			
		

	
	
	
		
				
			Longtermist altruists – who care about how much impact they have, but not about when that impact occurs – have a strong reason to invest resources before using them directly. Invested resources could grow much larger and be used to do much more good in the future. For example, a $1 investment that grows 5% per year would become $17,000 in 200 years. …
					
		
		
	

	



	
	
		
	
		

			
				Summary: Longtermist institutional reform (Tyler M. John and William MacAskill)
			

			
			
			
			
		

	
	
	
		
				
			Political decisions can have lasting effects on the lives and wellbeing of future generations. Yet political institutions tend to make short-term decisions with only the current generation – or even just the current election cycle – in mind. In “longtermist institutional reform”, Tyler M. John and William MacAskill identify the causes of short-termism in government and give four recommendations…
					
		
		
	

	



	
	
		
	
		

			
				Summary: Do not go gentle: why the Asymmetry does not support anti-natalism (Andreas Mogensen)
			

			
			
			
			
		

	
	
	
		
				
			Many people believe that it makes the world worse to create miserable lives, but that it doesn’t make the world better to create happy lives. This is one way of expressing “the Asymmetry” in population ethics. If we go on creating new people, many will be happy, but some will be unhappy. If we accept the Asymmetry, the continued existence of humanity therefore involves…
					
		
		
	

	



	
	
		
	
		

			
				Summary: The paralysis argument (William MacAskill and Andreas Mogensen)
			

			
			
			
			
		

	
	
	
		
				
			For consequentialists, the outcomes that follow from our actions fully determine the moral value of our actions. Actions are right to the extent they bring about good outcomes and wrong to the extent they bring about bad outcomes. If, as many philosophers believe (Greaves & MacAskill, 2021), the best outcomes we can bring about involve improving the long-run future for sentient life…
					
		
		
	

	



	
	
		
	
		

			
				Summary: The Epistemic Challenge to Longtermism (Christian Tarsney)
			

			
			
			
			
		

	
	
	
		
				
			According to longtermism, what we should do mainly depends on how our actions might affect the long-term future. This claim faces a challenge: the course of the long-term future is difficult to predict, and the effects of our actions on the long-term future might be so unpredictable as to make longtermism false. …
					
		
		
	

	



	
	
		
	
		

			
				Summary: A Paradox for Tiny Probabilities and Enormous Values (Nick Beckstead and Teruji Thomas)
			

			
			
			
			
		

	
	
	
		
				
			Many decisions in life involve balancing risks with their potential payoffs. Sometimes, the risks are small: you might be killed by a car while walking to the shops, but it would be unreasonably timid to sit at home and run out of toilet paper in order to avoid this risk. Other times, the risks are overwhelmingly large: your lottery ticket might win tomorrow, but it would be reckless to borrow £20,000 from a loan shark…
					
		
		
	

	



	
	
		
	
		

			
				Summary: Staking our future: deontic long-termism and the non-identity problem (Andreas Mogensen)
			

			
			
			
			
		

	
	
	
		
				
			In “The case for strong longtermism”, Greaves and MacAskill (2021) argue that potential far-future effects are the most important determinant of the value of our options. This is “axiological strong longtermism”. On some views, we can achieve astronomical value by making the future population of worthwhile lives much greater than it would otherwise have been…
					
		
		
	

	



	
	
		
	
		

			
				Summary: Moral demands and the far future (Andreas Mogensen)
			

			
			
			
			
		

	
	
	
		
				
			Consequentialism is the view that good and right coincide: right actions are those which maximise good and minimise bad. The best-known form of consequentialism is utilitarianism. By inviting morality to override all else in our lives, utilitarianism hence inspires what is known as the demandingness objection: that utilitarianism asks far too much of us and so is unacceptable as a moral theory. …
					
		
		
	

	



	
	
		
	
		

			
				Summary: The Case for Strong Longtermism (Hilary Greaves and William MacAskill)
			

			
			
			
			
		

	
	
	
		
				
			In this paper, Greaves and MacAskill make the case for strong longtermism: the view that the most important feature of our actions today is their impact on the far future. They claim that strong longtermism is of the utmost significance: that if the view were widely adopted, much of what we prioritise would change. …
					
		
		
	

	



	
	
		
	
		

			
				Summary: Doomsday rings twice (Andreas Mogensen)
			

			
			
			
			
		

	
	
	
		
				
			We live at a time of rapid technological development, and how we handle the most powerful emerging technologies could determine the fate of humanity. Indeed, our ability to prevent such technologies from causing extinction could put us amongst the most important people in history. In “Doomsday rings twice”, Andreas Mogensen illustrates how our potential importance could be evidence…