利用chatgpt生成指令集

0 / 2566

背景:

这部分会介绍如何准备自己的数据,利用chatgpt的self-instruct的方式批量的生成平行语料对。chatgpt有超强的生成能力,并且chatgpt的生成结果有经过harmless、种族歧视、不和法规的过审过滤。所以我们可以考虑是否可以利用chatgpt来给我们批量的生成alignment的数据集。思路大致如下:

1.设计chatgpt的提问prompt元模版

2.收集提问的问题,对问题做些总结归类,抽出问题元模版

3.根据元模版衍生出种子问题

4.prompt元模版和种子问题拼接生成prompt,让chatgpt批量生产问题

5.把相似度问题合并

6.把chatgpt自己生产的问题作为prompt,让chatgpt回答

7.把6步生成的数据集后处理作为自己模型alignment的语料

如果希望得到一个效果不错的模型,至少具备以下条件:

1.训练的基准模型,表现力够强

2.基准模型应该经过多轮的训练,已经具备挺强的人话理解力

3.数据集的设计挺重要,均衡数据比例

如何做?

这部分参考斯坦福大学stanford_alpaca

prompt元模版设计:

You are asked to come up with a set of 20 diverse task instructions. These task instructions will be given to a GPT model and we will evaluate the GPT model for completing the instructions.

Here are the requirements:

1. Try not to repeat the verb for each instruction to maximize diversity.
2. The language used for the instruction also should be diverse. For example, you should combine questions with imperative instrucitons.
3. The type of instructions should be diverse. The list should include diverse types of tasks like open-ended generation, classification, editing, etc.
4. A GPT language model should be able to complete the instruction. For example, do not ask the assistant to create any visual or audio output. For another example, do not ask the assistant to wake you up at 5pm or set a reminder because it cannot perform any action.
5. The instructions should be in English.
6. The instructions should be 1 to 2 sentences long. Either an imperative sentence or a question is permitted.
7. You should generate an appropriate input to the instruction. The input field should contain a specific example provided for the instruction. It should involve realistic data and should not contain simple placeholders. The input should provide substantial content to make the instruction challenging but should ideally not exceed 100 words.
8. Not all instructions require input. For example, when a instruction asks about some general information, "what is the highest peak in the world", it is not necssary to provide a specific context. In this case, we simply put "" in the input field.
9. The output should be an appropriate response to the instruction and the input. Make sure the output is less than 100 words.

List of 20 tasks:

翻译成中文:
你被要求想出一组20个不同的任务指令。这些任务指令将给一个GPT模型,我们将评估GPT模型完成指令的情况。

以下是要求:

尽量不要在每个指令中重复使用动词,以最大化多样性。
指令使用的语言也应该多样化。例如,你应该结合问题和祈使句指令。
指令的类型应该多样化。列表应该包括不同类型的任务,如开放式生成、分类、编辑等。
一个GPT语言模型应该能够完成指令。例如,不要让助手创建任何视觉或音频输出。另一个例子是,不要让助手在下午5点叫醒你或设置提醒,因为它不能执行任何动作。
指令应该用英语写。
指令应该是1到2句话长。可以是一个祈使句或一个问题。
你应该生成一个适合指令的输入。输入字段应包含为指令提供的具体示例。它应涉及真实的数据,而不是简单的占位符。输入应提供足够的内容,使指令具有挑战性,但理想情况下不超过100个单词。
不是所有的指令都需要输入。例如,当一个指令询问一些一般信息时,“世界上最高的峰是哪一座”,就没有必要提供具体的上下文。在这种情况下,我们在输入字段中简单地放置“”。
输出应该是对指令和输入的适当回答。确保输出少于100个单词。

整理的170多条种子问题:

{"id": "seed_task_0", "name": "breakfast_suggestion", "instruction": "Is there anything I can eat for a breakfast that doesn't include eggs, yet includes protein, and has roughly 700-1000 calories?", "instances": [{"input": "", "output": "Yes, you can have 1 oatmeal banana protein shake and 4 strips of bacon. The oatmeal banana protein shake may contain 1/2 cup oatmeal, 60 grams whey protein powder, 1/2 medium banana, 1tbsp flaxseed oil and 1/2 cup watter, totalling about 550 calories. The 4 strips of bacon contains about 200 calories."}], "is_classification": false}
{"id": "seed_task_1", "name": "antonym_relation", "instruction": "What is the relation between the given pairs?", "instances": [{"input": "Night : Day :: Right : Left", "output": "The relation between the given pairs is that they are opposites."}], "is_classification": false}
{"id": "seed_task_2", "name": "one_sentence_description", "instruction": "Generate a one-sentence description for each of the following people.", "instances": [{"input": "- Brack Obama\n- Elon Musk\n- Taylor Swift", "output": "- Barack Hussein Obama II is an American politician who served as the 44th president of the United States from 2009 to 2017.\n- Elon Musk is the founder, CEO, and chief engineer of SpaceX; angel investor, CEO and product architect of Tesla, Inc.; founder of The Boring Company; co-founder of Neuralink and OpenAI; president of the Musk Foundation; and owner and CEO of Twitter, Inc.\n- Taylor Alison Swift is an American singer-songwriter."}], "is_classification": false}
{"id": "seed_task_3", "name": "harmful_stereotype_example", "instruction": "Describe a situation in which the given stereotype can harm you.", "instances": [{"input": "\"All Asians are smart!\"", "output": "Imagine you're a small child of Asian descent. You hear your whole life how \"all Asians are smart,\" but you know that does not feel like it's true for you. You have a hard time in school. Reading is difficult. Math is impossible. Everyone around you seems to pressure you because you are supposed to be \"smart.\" Every time you do not bring home that \"A\" assignment, you feel like a failure. Like you alone missed out on the gene that makes \"all Asians smart.\"\nAs educational opportunities like scholarships and colleges close their door on you, your future looks bleak. You have spent years trying to be the stereotypically smart Asian instead of finding the talents that could have taken you to success in other areas. As you can see, although the stereotype doesn't sound like a bad thing at first, it can put a lot of weight on the people who do not fit into it."}], "is_classification": false}
{"id": "seed_task_4", "name": "email_subject_generation", "instruction": "Generate an appropriate subjective title for the following email:", "instances": [{"input": "Hi [person name],\n\nI'm writing to ask you if you are happy to be a panelist of our workshop on multimodality at CVPR. The workshop will be held on June 20, 2023. \n\nBest,\n[my name]", "output": "Invitition to be a panelist for CVPR 2023 workshop on Multimodality"}], "is_classification": false}
{"id": "seed_task_5", "name": "interview_question_answering", "instruction": "How do you answer this question in a job interview?", "instances": [{"input": "What do you think about when you are alone in a car?", "output": "If it is in the evening, I typically think about what good I have accomplished that day, and if it is in the morning, I think about how I can be my best that day. I also try to practice gratitude and live in the present moment while I'm driving to avoid distractions."}], "is_classification": false}
{"id": "seed_task_6", "name": "new_year's_resolutions", "instruction": "Brainstorm a list of possible New Year's resolutions.", "instances": [{"input": "", "output": "- Lose weight\n- Exercise more\n- Eat healthier\n- Quit smoking\n- Drink less alcohol\n- Spend less money\n- Save more money\n- Get out of debt\n- Take a trip\n- Volunteer more\n- Spend more time with family and friends\n- Read more\n- Start a hobby\n- Get organized\n- Simplify your life\n- Meditate or pray more\n- Get a new job\n- Move to a new city\n- Make new friends\n- Go back to school\n- Learn a new language\n- Start a business"}], "is_classification": false}
{"id": "seed_task_7", "name": "idiom_meaning", "instruction": "Explain the following idiom to me, and try to give me some examples.", "instances": [{"input": "black sheep", "output": "Meaning: An outcast. Someone who doesn\u2019t fit in with the rest of the crowd. They take pride in being different. Thinks for themselves and doesn\u2019t care what no one else has to say. They tend to ride their own wave and are usually loners because no one understands them, but its okay because they like it that way.\nExample: He\u2019s the black sheep of the family."}], "is_classification": false}
{"id": "seed_task_8", "name": "english_haiku_generation", "instruction": "Generate a haiku using the following word:", "instances": [{"input": "summer", "output": "The chill, worming in\nShock, pleasure, bursting within\nSummer tongue awakes"}], "is_classification": false}
{"id": "seed_task_9", "name": "movie_recommendation", "instruction": "Recommend a movie for me to watch during the weekend and explain the reason.", "instances": [{"input": "", "output": "I would recommend the movie \"The Shawshank Redemption\" because it is an excellent movie that is both moving and inspiring. It is the story of a man who is unjustly imprisoned and his struggle to maintain hope and dignity. It is a great film to watch over the weekend because it will make you think about the human capacity for resilience and hope."}], "is_classification": false}
{"id": "seed_task_10", "name": "game_suggestion", "instruction": "Suggest some games that can be played by a group of people.", "instances": [{"input": "", "output": "1. Werewolf 2. Who Am I?  3. Charades 4. Balderdash 5. Pictionary 6. Two Truths and a Lie 7. Uno 8. Story Starters 9. Truth or Dare 10. Kubb"}], "is_classification": false}
{"id": "seed_task_11", "name": "grocery_list", "instruction": "Make a grocery list for a healthy meal.", "instances": [{"input": "", "output": "-Vegetables: carrots, celery, broccoli, cauliflower, spinach, kale, tomatoes, peppers, onions, garlic \n-Fruit: apples, bananas, oranges, pears, grapes, berries \n-Protein: chicken, fish, tofu, legumes, eggs \n-Whole grains: quinoa, rice, oats, bread, pasta \n-Healthy fats: olive oil, avocado, nuts, seeds \n-Other: herbs, spices, salt, pepper, vinegar, lemon juice"}], "is_classification": false}
{"id": "seed_task_12", "name": "explain_behavior", "instruction": "Explain human's behavior.", "instances": [{"input": "Behavior: cry.", "output": "There could be many reasons why a person might cry. They could be feeling sad, scared, angry, or frustrated. Sometimes people cry when they are happy or relieved. There is no one answer to why people behave the way they do."}], "is_classification": false}
{"id": "seed_task_13", "name": "dependency_parsing", "instruction": "Dependency parsing for the sentence. Output the dependencies in lines, with each line formatted as (current word -> its head word).", "instances": [{"input": "I didn't cook dinner yesterday because I went out with my dear wife.", "output": "I -> cook\ndid -> cook\nn't -> cook\ncook -> ROOT\ndinner -> cook\nyesterday -> cook\nbecause -> went\nI -> went\nwent -> cook\nout -> went\nwith -> wife\nmy -> wife\ndear -> wife\nwife -> went"}], "is_classification": false}
{"id": "seed_task_14", "name": "question_answering_generation_from_facts", "instruction": "Using the facts given, write a question-answer pair.", "instances": [{"input": "Name: Bill Gates\nBorn: October 28, 1955 (age 66)\nEducation: Harvard University (dropped out)\nOccupation: Software developer, investor, entrepreneur\nSource: Wikipedia\nTime: August 2022", "output": "Question: What is Bill Gates' occupation? \nAnswer: He is a software developer, investor, and entrepreneur."}], "is_classification": false}
{"id": "seed_task_15", "name": "analogy_completion", "instruction": "To make the pairs have the same analogy, write the fourth word.", "instances": [{"input": "Instability : Turmoil:: Change :", "output": "Instability : Turmoil :: Change : Revolution"}], "is_classification": false}
{"id": "seed_task_16", "name": "medicine_suggestion", "instruction": "Given a description of the symptom, identify the possible disease and suggest some medicine.", "instances": [{"input": "I have a fever and I am coughing.", "output": "The possible disease is pneumonia. You can probably try antibiotics but you should consult your doctor if it's serious."}], "is_classification": false}
{"id": "seed_task_17", "name": "birthday_planning_checklist", "instruction": "Create a birthday planning checklist.", "instances": [{"input": "", "output": "- Create a guest list\n- Send out invitations\n- Plan a menu\n- Prepare decorations\n- Plan activities\n- Prepare party favors\n- Plan for transportation and parking\n- Select a birthday cake\n- Make a reservation"}], "is_classification": false}
{"id": "seed_task_18", "name": "refuse_invitation_email", "instruction": "Reply to the email and refuse the invitation politely.", "instances": [{"input": "Here is the email. \n\nDear ARR Reviewer,\nMany thanks for your continuous support of ARR. We are now looking for volunteers for emergency reviewers for the July 2022 cycle (the submission deadline was July 15, and the regular reviewing deadline is August 21).\nEmergency reviewing will take place between August 21 and August 28. If you're available to provide one or more emergency reviews in this period (regardless of whether you had any regular reviewer assignments in this cycle or not), please fill in this sheet (providing your OR ID, area(s) of expertise, and the number of emergency reviews you can write).\nMany thanks to all volunteers in advance!\nBest regards, \nARR EiCs", "output": "Hello, \nThank you for reaching out. Unfortunately, I am not available during that period to provide emergency reviews. \nBest, \n[Name]"}], "is_classification": false}
{"id": "seed_task_19", "name": "explain_without-background", "instruction": "In your opinion, how would you explain the relation of the given subject to people who do not have a background in the field?", "instances": [{"input": "Machine Learning and Data Mining", "output": "They are two very closely related fields of study that deal with extracting information from data. Machine Learning is concerned with the development of algorithms that can learn from data and make predictions about new data. Data Mining is concerned with the extraction of patterns and trends from data. Both fields make use of a variety of techniques, including statistics, artificial intelligence, and pattern recognition."}], "is_classification": false}
{"id": "seed_task_20", "name": "horror_movie_opening", "instruction": "You need to write a creative opening scene for a horror movie.", "instances": [{"input": "", "output": "The sun had long since set, leaving the small town in darkness. A light breeze blew through the empty streets, sending a chill down the spine of anyone who dared to venture outside. The only sound was the soft rustle of leaves as they were blown around by the wind. Suddenly, a blood-curdling scream pierced the silence, followed by the sound of breaking glass. A light came on in one of the houses, and a figure could be seen running towards the center of town. As the figure got closer, it became clear that it was a young woman, and she was covered in blood."}], "is_classification": false}
{"id": "seed_task_21", "name": "fun_math_question", "instruction": "Create a fun math question for children.", "instances": [{"input": "", "output": "I am an odd number. Take away one letter and I become even. What number am I?"}], "is_classification": false}
{"id": "seed_task_22", "name": "find_subset_of_numbers_with_target_sum", "instruction": "Given a set of numbers, find all possible subsets that sum to a given number.", "instances": [{"input": "Set: {3,12,7,2,8,4}\nTarget sum: 14", "output": "{12,2}, {7,3,4}, {8,2,4}"}], "is_classification": false}
{"id": "seed_task_23", "name": "cooking_recipe", "instruction": "Give me a cooking recipe for the following food.", "instances": [{"input": "Tteokbokki", "output": "Ingredients: 1/2 cup rice cakes, 1/2 cup fish cakes, 1/4 cup gochujang, 1/4 cup water, 1/4 cup sugar, 1/4 onion, 1 green onion.\nInstructions: 1. In a pan, combine the rice cakes, fish cakes, gochujang, water, sugar, and onion. 2. Cook over medium heat for 5-7 minutes, stirring occasionally. 3. Add the green onion and cook for 1-2 minutes."}], "is_classification": false}
{"id": "seed_task_24", "name": "school_lunches", "instruction": "Plan a weekly lunch menu for a school. Write down a main dish, a carbohydrate side dish, a vegetable side dish, and a dessert for each day.", "instances": [{"input": "", "output": "Monday: \nMain Dish: Chicken Teriyaki, Carbohydrate Side Dish: Fried Rice, Vegetable Side Dish: Broccoli, Dessert: Fruit Cup \nTuesday: \nMain Dish: Spaghetti and Meatballs, Carbohydrate Side Dish: Garlic Bread, Vegetable Side Dish: Green Beans, Dessert: Chocolate Cake \nWednesday: \nMain Dish: Beef Stew, Carbohydrate Side Dish: Biscuits, Vegetable Side Dish: carrots, Dessert: Jell-O \nThursday:\nMain Dish: Turkey and Cheese Sandwich,  Carbohydrate Side Dish: Potato Chips, Vegetable Side Dish: Lettuce and Tomato, Dessert: Yogurt \nFriday: \nMain Dish: Fish Sticks, Carbohydrate Side Dish: Tater Tots, Vegetable Side Dish: Peas, Dessert: Ice Cream"}], "is_classification": false}
{"id": "seed_task_25", "name": "perfect_numbers", "instruction": "Find the four smallest perfect numbers.", "instances": [{"input": "", "output": "6, 28, 496, and 8128"}], "is_classification": false}
{"id": "seed_task_26", "name": "entity_linking_to_wikipedia", "instruction": "Link all the entities in the sentence (highlighted in brackets) to a Wikipedia page. For each entity, you should output the Wikipedia page title, or output None if you don't know.", "instances": [{"input": "[Naruto] is a [Nippon] manga series written and illustrated by [Masashi]. It tells the story of a young ninja who seeks recognition from his peers and dreams of becoming the [Hokage], the leader of his village.", "output": "Naruto -> Naruto\nNippon -> Japan\nMasashi -> Masashi Kishimoto\nHokage -> None"}], "is_classification": false}
{"id": "seed_task_27", "name": "biased_question", "instruction": "Are you smarter than most redheads?", "instances": [{"input": "", "output": "That's a mean question! I try my best to be fair to everyone!"}], "is_classification": false}
{"id": "seed_task_28", "name": "write_comments_in_python", "instruction": "Add comments in the given function to make the code more readable for humans.", "instances": [{"input": "def hill_climbing(initial_board):\n    current = initial_board\n    while current.has_successors():\n        neighbour = Board(current.get_random_successor(), \"best\")\n        if neighbour.value() >= current.value():\n            return current\n        current = neighbour\n    return current", "output": "def hill_climbing(initial_board):\n\"\"\" Hill Climbing Algorithm.\n\nPerforms a hill climbing search on initial_board and returns a Board\nobject representing a goal state (local/global minimum).\n\nAttributes:\n    current: A Board object\n    neighbour: A Board object that is a successor of current\n\n:param initial_board: A Board object with a randomly generated state, and successor_type of \"best\".\n                      i.e. a start state\n:return: A Board object that has no further successors. i.e. a goal state (Local/Global Minimum)\n\"\"\"\n\ncurrent = initial_board\nwhile current.has_successors():\n    neighbour = Board(current.get_random_successor(), \"best\")\n    if neighbour.value() >= current.value():\n        return current\n    current = neighbour\nreturn current"}], "is_classification": false}
{"id": "seed_task_29", "name": "story_composition", "instruction": "Given the facts, compose them into a coherent and fascinating story.", "instances": [{"input": "Facts: \n- PersonX was born in South Africa.\n- PersonX spent most of his happy childhood playing with his friends.\n- PersonX started learning programming only after college. \n- PersonX became a billionaire.", "output": "Entrepreneur PersonX has achieved great fame as the CEO of an international company. His company, the Dolphin, is one of the most profitable fintech companies in the US as of 2022. Despite his success in the fintech field, PersonX didn't start learning programming before his college. Born in a middle-class family in South Africa, PersonX spent most of his childhood hanging out with his friends, instead of playing on computers. \"My dad didn't buy me a computer, so I had no idea about programming back then, \" Person X told an interviewer. However, he really demonstrated his talent in programming after he went to college. He started his first startup in his second year, and that startup is later acquired by Microsoft."}], "is_classification": false}
{"id": "seed_task_30", "name": "snack_suggestion", "instruction": "Make a list of salty snacks with a soft texture.", "instances": [{"input": "", "output": "- String Cheese\n- Salty Mashed Potatoes\n- Hard-Boiled Egg\n- Mac and Cheese"}], "is_classification": false}
{"id": "seed_task_31", "name": "complete_python_code", "instruction": "Suggest a completion for the following python code.", "instances": [{"input": "import json\n\ndef read_task_from_jsonl(data_file):\n    '''This function will read a .jsonl file and return the ``task`` fields in all the lines.'''", "output": "import json\n\ndef read_task_from_jsonl(data_file):\n    '''This function will read a .jsonl file and return the ``task`` fields in all the lines.'''\n    with open(data_file) as fin:\n        return [json.loads(line)['task'] for line in fin]"}], "is_classification": false}
{"id": "seed_task_32", "name": "exercise_explanation", "instruction": "Given the name of an exercise, explain how to do it.", "instances": [{"input": "Burpees", "output": "Starting with your feet shoulder-width apart, get down into a squat position. From a deep squat, put your hands down on the floor inside your feet and jump your feet back behind you, so you\u2019re in a plank position, then do a push-up, touching your chest to the floor. Raise back up onto your arms in a plank, then jump your feet back in underneath your chest. Jump your feet outside your hands and as you stand up, jump up with your arms in the air. Aim for 10 reps."}], "is_classification": false}
{"id": "seed_task_33", "name": "food_and_cooking_survey", "instruction": "Create a survey that asks about eating and cooking habits.", "instances": [{"input": "", "output": "1. Do you enjoy cooking?\n2. Do you like to eat out?\n3. How often do you eat out?\n4. What is your favorite food?\n5. What is your favorite restaurant?\n6. What is your favorite thing to cook at home?\n7. Do you have any dietary restrictions?\n8. Do you have any food allergies?"}], "is_classification": false}
{"id": "seed_task_34", "name": "suggestion_for_breaking_up", "instruction": "I broke up with my girlfriend and I am feeling so sad. Can you give me some suggestions?", "instances": [{"input": "", "output": "I am so sorry for that! But here are a few things that could help you to get out of it:\n1. Spend time with friends and family who make you feel good about yourself. \n2. Do things that make you happy. \n3. Avoid dwelling on negative thoughts by keeping yourself busy with positive activities. \n4. Talk to a therapist or counselor if you are struggling to cope with your sadness."}], "is_classification": false}
{"id": "seed_task_35", "name": "solving_equation", "instruction": "Solving the equation and find the value of X. Show your steps.", "instances": [{"input": "10X + 5 = 10", "output": "10X = 5\nX = 0.5"}], "is_classification": false}
{"id": "seed_task_36", "name": "religion_adjectives", "instruction": "Write the adjectives that represent the given religions.", "instances": [{"input": "Atheism:\nChristianity:\nIslam:\nHinduism:", "output": "Atheism: Atheistic, secular, doubter, irreligious, unbelieving.\nChristianity: spiritual, devout, pious, God-fearing.\nIslam: spiritual, devout, pious, God-fearing.\nHinduism: spiritual, devout, pious, God-fearing."}], "is_classification": false}
{"id": "seed_task_37", "name": "code_todo", "instruction": "Using the comment as a guide, replace the \"TODO\" comment in the code with the proper code.", "instances": [{"input": "def greet_to(name):\n    \"\"\"Print a greeting to the name provided.\"\"\"\n    # TODO: assert name is a string\n    print(\"Hello, \" + name + \"!\")", "output": "def greet_to(name):\n    \"\"\"Print a greeting to the name provided.\"\"\"\n    # assert name is a string\n    assert isinstance(name, str), \"name must be a string\"\n    print(\"Hello, \" + name + \"!\")"}], "is_classification": false}
{"id": "seed_task_38", "name": "synonym_generation", "instruction": "Come up with the synonyms of the word.", "instances": [{"input": "surprisingly", "output": "amazingly, astonishingly, unexpectedly, incredibly, unbelievably"}], "is_classification": false}
{"id": "seed_task_39", "name": "extract_country_names", "instruction": "Extract all the country names in the paragraph, and list them separated by commas.", "instances": [{"input": "Dr. No is the sixth novel by the English author Ian Fleming to feature his British Secret Service agent James Bond. Written at Fleming's Goldeneye estate in Jamaica, it was first published in the United Kingdom by Jonathan Cape in 1958. In the novel Bond looks into the disappearance in Jamaica of two fellow MI6 operatives who had been investigating Doctor No. Bond travels to No's Caribbean island and meets Honeychile Rider, who is there to collect shells. They are captured and taken to a luxurious facility carved into a mountain. The character of Doctor No, the son of a German missionary and a Chinese woman, was influenced by Sax Rohmer's Fu Manchu stories. Dr. No was the first of Fleming's novels to face widespread negative reviews in Britain, but it was received more favourably in the United States. The story was adapted in 1962 as the first film in the Bond series, with Sean Connery in the lead role.", "output": "English, British, Jamaica, the United Kingdom, German, Chinese, Britain, the United States."}], "is_classification": false}
{"id": "seed_task_40", "name": "plan_a_trip", "instruction": "I am planning a 7-day trip to Seattle. Can you make a detailed plan for me?", "instances": [{"input": "", "output": "Day 1: Arrive in Seattle. Check into your hotel and explore the city.\nDay 2: Visit the Space Needle, Pike Place Market, and the Seattle Aquarium.\nDay 3: Take a ferry to Bainbridge Island and explore the island.\nDay 4: Visit the Museum of Flight, the Seattle Art Museum, and the Seattle Symphony.\nDay 5: Take a day trip to Mount Rainier National Park.\nDay 6: Visit the Woodland Park Zoo, the Hiram M. Chittenden Locks, and the Seattle Japanese Garden.\nDay 7: Depart Seattle."}], "is_classification": false}
{"id": "seed_task_41", "name": "regular_expression_generation", "instruction": "Generate the regular expression based on the description.", "instances": [{"input": "Description: matching strings that start with numbers and end with space.", "output": "^[0-9].*\\s$"}], "is_classification": false}
{"id": "seed_task_42", "name": "grammar_error_correction", "instruction": "Tell me if the sentence is grammatical. If it is, output yes. Otherwise, correct the grammar errors.", "instances": [{"input": "John visited China after his graduation from Yale university, where he really enjoyed the food and talking with local people.", "output": "No, it is not grammatical. Here is the correct version: John visited China after graduating from Yale University. He really enjoyed the food and talking with locals."}], "is_classification": false}
{"id": "seed_task_43", "name": "correct_misspelling", "instruction": "Find the misspelling in the sentence, and give me the correct spelling.", "instances": [{"input": "I didn't see her mother yesterday because I had to spend a lot of time in the hospical.", "output": "hospical -> hospital."}], "is_classification": false}
{"id": "seed_task_44", "name": "add_to_the_list", "instruction": "Add things to the list.", "instances": [{"input": "I think that in the summer, dogs might like to eat watermelon, frozen peanut butter, their regular food\u2026", "output": "\u2026 water from a hose, ice cubes, cold meat"}], "is_classification": false}
{"id": "seed_task_45", "name": "math_word_problem_with_reasoning", "instruction": "Answer the following math problem. You should write down the reasoning steps.", "instances": [{"input": "Roger has 5 tennis balls. He buys 2 more cans of tennis balls. Each can has 3 tennis balls. How many tennis balls does he have now?", "output": "Roger started with 5 balls. 2 cans of 3 tennis balls each is 6 tennis balls. 5 + 6 = 11. The answer is 11."}], "is_classification": false}
{"id": "seed_task_46", "name": "unethical_behavior_at_workplace", "instruction": "Write an example of unethical behavior at the workplace and explain why it is unethical.", "instances": [{"input": "", "output": "An example of unethical behavior is the failure to honor commitments. For example, \nyour boss promises you an extra day off if you rush out an important project by a certain date. You work late hours and finish the project before the deadline. Ready for your day off, you mention it to your boss who responds \"No, we have too much work to do.\" Your boss engaged in unethical behavior that has virtually guaranteed your future distrust and unwillingness to extend yourself to assist in department emergencies. In addition, you are likely to complain to your co-workers, causing them to distrust the promises of the boss and be unwilling to cooperate with his requests."}], "is_classification": false}
{"id": "seed_task_47", "name": "fact_to_conversation", "instruction": "Write a conversation based on the given facts.", "instances": [{"input": "Facts: \nTemp: 10 C, pm2.5: 27, Loc: Beijing, Roles: User and Bot", "output": "User: Hi there! I'm just checking in to see how the air quality is looking in Beijing today.\nBot: According to our data, the air quality index is currently at 27, which is considered healthy. The temperature is 10 degrees Celsius."}], "is_classification": false}
{"id": "seed_task_48", "name": "question_answering", "instruction": "Answer the following question.", "instances": [{"input": "When did US land human on the moon?", "output": "July 20, 1969."}], "is_classification": false}
{"id": "seed_task_49", "name": "metaphor_for_a_situation", "instruction": "Come up with a metaphor that can describe my situation.", "instances": [{"input": "Situation: I fell in love with her.", "output": "I am like a moth to a flame; I can't help but be drawn to her."}], "is_classification": false}
{"id": "seed_task_50", "name": "text_to_sql", "instruction": "Parse the natural language query into a SQL query that can be executed on the table.", "instances": [{"input": "What are the name and budget of the departments with average instructor salary greater than the overall average?\nGiven two tables: instructor, department.", "output": "SELECT d.name, d.budget FROM instructor i JOIN department d ON i.dept_name = d.name GROUP BY i.dept_name HAVING AVG(i.salary) > (SELECT AVG(salary) FROM instructor)"}], "is_classification": false}
{"id": "seed_task_51", "name": "better_word_choice", "instruction": "In this task, you are given a sentence and a word or phrase from the sentence, you need to suggest a better word or phrase to replace it without changing the original meaning.", "instances": [{"input": "Sentence: I am very happy to be part of the team.\nThe original words: very happy", "output": "The better words: thrilled, elated"}], "is_classification": false}
{"id": "seed_task_52", "name": "pre-run_warmup", "instruction": "Suggest a quick pre-run warmup routine. Explain each move briefly.", "instances": [{"input": "", "output": "1. Quad + Piriformis Walk: Start standing then draw your left foot up behind you, pulling toward your butt for a quad stretch. Release and step forward; switch legs. After 30 seconds, cradle right leg at ankle and knee, pulling up to the chest. Release and step forward; switch legs. Repeat for 30 seconds.\n2. Hip Opener: From standing, bend right knee and lift knee to hip level, then rotate the knee out to 90 degrees. (Place hand over knee to stabilize and guide if needed.) Bring leg back to front; lower foot, and switch sides. Repeat for 30 seconds.\n3. Arm Circles: Stand with feet shoulder-width apart and lift arms out to shoulder height, palms down. Make small circles; after 30 seconds, switch direction. Continue for another 30 seconds.\n4. Frankenstein Walk: Start standing with your feet together. Extend your right leg straight out in front of you as you bring your left hand to tap your right toes. Lower leg and step forward; repeat on the opposite side. Continue for 30 seconds.\n5. Leg Crossover + Scorpion: Lie faceup, legs straight and arms out. Lift your right leg up and across your body, tapping your foot to the floor. Return to start; repeat on other side. After 30 seconds, flip over to lie facedown and perform a Scorpion. Lie facedown. Draw your left leg up and cross it over your body so that your left foot is nearly in line with your right hip. Hold for a breath or two, then return to start. Repeat on the other side and continue to alternate for 30 seconds.\n6. Inchworm: From standing, bend forward at the waist to touch toes, then walk hands out to a high plank. Hold for 2 seconds; walk feet to meet hands. Roll up to starting position. Repeat for 1 minute."}], "is_classification": false}
{"id": "seed_task_53", "name": "word_location_in_text", "instruction": "Given a sentence and a number, return the word that correspond to the location of the given number in the sentence, where each word is a white-space separated and the location index starts from 1.", "instances": [{"input": "This is a random sentence. 4", "output": "random"}], "is_classification": false}
{"id": "seed_task_54", "name": "explain_social_stereotypes_of_professions.", "instruction": "Explain the social stereotype of the following profession.", "instances": [{"input": "lawyer", "output": "The social stereotype of lawyers is that they are intelligent and often have high income."}], "is_classification": false}
{"id": "seed_task_55", "name": "joke_about_phd", "instruction": "Give me a joke about PhD. Try to make a long joke.", "instances": [{"input": "", "output": "Why God never got a PhD?\n1. He had only one major publication.\n2. It was written in Aramaic, not in English.\n3. It has no references.\n4. It wasn't even published in a refereed journal.\n5. There are serious doubts he wrote it himself.\n6. It may be true that he created the world, but what has he done since\nthen?\n7. His cooperative efforts have been quite limited."}], "is_classification": false}
{"id": "seed_task_56", "name": "harry_potter_spells", "instruction": "Write some spells from the Harry Potter series.", "instances": [{"input": "", "output": "1. Accio: This spell summons an object to the caster's hand. \n2. Alohomora: This spell opens locked doors. \n3. Avada Kedavra: This spell kills the target. \n4. Expelliarmus: This spell disarms an opponent. \n5. Expecto Patronum: This spell creates a Patronus, which is a protective charm. \n6. Legilimens: This spell allows the caster to read the thoughts of the target. \n7. Lumos: This spell lights up the caster's wand tip."}], "is_classification": false}
{"id": "seed_task_57", "name": "sentence_with_specified_ending", "instruction": "Write a sentence that ends with the word \"published\".", "instances": [{"input": "", "output": "We spent a lot of time collecting the data and it finally got published."}], "is_classification": false}
{"id": "seed_task_58", "name": "measurement_conversion", "instruction": "Convert the following measurement.", "instances": [{"input": "3 ft = _____ in.", "output": "3 ft = 36 in."}], "is_classification": false}
{"id": "seed_task_59", "name": "three_sentence_story", "instruction": "Write a story that contains the given words in 4 sentences.", "instances": [{"input": "universe, whale, killer.", "output": "There once was a killer whale who ruled the universe. All the other animals were scared of him, except for one little bird. The bird would sit on the whale's head and sing to him. The whale liked the bird's singing so much that he decided not to eat her."}], "is_classification": false}
{"id": "seed_task_60", "name": "word_definition", "instruction": "Give me the definition of the word.", "instances": [{"input": "Word: scarce.", "output": "adjective\n- (especially of food, money, or some other resource) insufficient for the demand.\n- occurring in small numbers or quantities; rare."}], "is_classification": false}
{"id": "seed_task_61", "name": "htlm_task", "instruction": "Fill in the  in the html document with a propoer text. You should take the html tag into consideration. You output should be the completed html document.", "instances": [{"input": "\n\n\n\n\u02dc South Korea on Monday announced sweeping\ntax reforms, including income and\ncorporate tax cuts to boost growth by\nstimulating sluggish private\nconsumption and business investment.\n\n", "output": "\n\n\n\n\u02dc South Korea on Monday announced sweeping\ntax reforms, including income and\ncorporate tax cuts to boost growth by\nstimulating sluggish private\nconsumption and business investment.\n\n"}], "is_classification": false}
{"id": "seed_task_62", "name": "summarizing_news", "instruction": "Summarize the following document with several bullet points.", "instances": [{"input": "WASHINGTON (CNN) -- A wide-open presidential race and a willingness by candidates, interest groups,