frankwong2001 commited on
Commit
da18792
·
verified ·
1 Parent(s): d6f426e

Add new SentenceTransformer model

Browse files
1_Pooling/config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "word_embedding_dimension": 768,
3
+ "pooling_mode_cls_token": false,
4
+ "pooling_mode_mean_tokens": true,
5
+ "pooling_mode_max_tokens": false,
6
+ "pooling_mode_mean_sqrt_len_tokens": false,
7
+ "pooling_mode_weightedmean_tokens": false,
8
+ "pooling_mode_lasttoken": false,
9
+ "include_prompt": true
10
+ }
README.md ADDED
@@ -0,0 +1,666 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - sentence-transformers
4
+ - sentence-similarity
5
+ - feature-extraction
6
+ - dense
7
+ - generated_from_trainer
8
+ - dataset_size:3016
9
+ - loss:MultipleNegativesRankingLoss
10
+ base_model: nomic-ai/modernbert-embed-base
11
+ widget:
12
+ - source_sentence: The Keeper/Aquarist is responsible for the care and management
13
+ of wildlife within the organisation/attractions sites. This includes the preparation
14
+ and feeding of food to the wildlife, caring for ill animals, checking enclosures
15
+ and cages for signs of wear or damage for animal, staff and visitor safety, and
16
+ giving educational talks/tours to the visitors. He/she also maintains animal training
17
+ behaviours and promotes conservation awareness through his animal presentations.
18
+ Detail-oriented with a strong passion for animals, he is attentive to the needs
19
+ of the wildlife under his care, and leverages his strong communication skills
20
+ to communicate effectively with visitors on the characteristics and behaviours
21
+ of the wildlife. He is physically fit and works in a shift system encompassing
22
+ weekends and public holidays. Outside the working hours, he may be on a rota for
23
+ call-outs. He often stays outdoors for long periods of time even through inclement
24
+ weather, and may need a driving licence if he is working in a large zoo or safari
25
+ park. He may also need a scuba-diving licence if working in an aquarium.
26
+ sentences:
27
+ - The Director of Nursing in the Education sector collaborates closely with the
28
+ Chief Nurse to develop a comprehensive nursing education framework that addresses
29
+ the diverse learning requirements of nursing students and practicing nurses. This
30
+ role involves identifying emerging competencies and partnering with critical stakeholders
31
+ to promote adaptable and responsive skill acquisition that enhances the nursing
32
+ workforce's capabilities. The Director champions nursing practice through exceptional
33
+ ongoing professional development initiatives tailored to meet the changing demands
34
+ of the national healthcare landscape. Additionally, they secure organizational
35
+ resources to support nurses in their lifelong learning journeys and encourage
36
+ the integration of cutting-edge technologies and innovations in nursing education.
37
+ This position operates across various environments, including acute care, primary
38
+ care, and community care, requiring a visionary leader who is dynamic and attuned
39
+ to the evolving healthcare needs, with strong leadership abilities in fostering
40
+ psychological resilience and creating effective learning environments.
41
+ - 'The Keeper/Aquarist is responsible for managing the administrative tasks related
42
+ to wildlife licensing and permits within the organization. This includes ensuring
43
+ compliance with local regulations, filing necessary documents, and maintaining
44
+ records, without direct interaction with the animals or visitors.
45
+
46
+
47
+ The Keeper/Aquarist oversees a team of junior staff members and is focused on
48
+ strategic planning for conservation policies rather than hands-on animal care.
49
+ This role requires several years of experience in management and decision-making
50
+ authority regarding budget allocations and resource management.
51
+
52
+
53
+ The Keeper/Aquarist works as a Compliance Associate in the banking sector, where
54
+ he/she analyzes regulatory frameworks and ensures adherence to financial policies.
55
+ This position requires a strong understanding of financial compliance rather than
56
+ animal welfare.
57
+
58
+
59
+ The Keeper/Aquarist is tasked with wildlife management in an international context,
60
+ focusing on cross-border regulatory issues and global wildlife trade laws, without
61
+ direct involvement in animal care or public education.
62
+
63
+
64
+ The Keeper/Aquarist combines duties of an event coordinator and animal caretaker,
65
+ responsible for planning wildlife-related events while also providing care for
66
+ the animals, leading to confusion in priorities and skill requirements across
67
+ both roles.'
68
+ - The Keeper/Aquarist is entrusted with the comprehensive care and management of
69
+ wildlife within the organization’s attractions. This role involves preparing and
70
+ providing nutritious food for the animals, tending to those that are unwell, and
71
+ inspecting enclosures for any signs of damage to ensure the safety of animals,
72
+ staff, and visitors. The Keeper/Aquarist also engages guests through educational
73
+ talks and tours, enhancing their understanding of wildlife conservation. With
74
+ a keen eye for detail and a deep passion for animals, he/she is dedicated to meeting
75
+ the needs of the creatures in their care and utilizes strong communication skills
76
+ to effectively share insights about animal behavior and characteristics with visitors.
77
+ The position requires physical fitness, as it involves working shifts that include
78
+ weekends and public holidays. Additionally, the Keeper/Aquarist may be on call
79
+ outside regular hours and is accustomed to spending extended periods outdoors,
80
+ regardless of weather conditions. A driving license is essential for those working
81
+ in larger facilities, and a scuba-diving license may be required for aquarium
82
+ settings.
83
+ - source_sentence: The Executive Sous Chef is responsible for managing kitchen operations
84
+ by running the pass and informing cooks of the orders, monitoring speed and rhythm
85
+ of coursing and overseeing plating of dishes throughout. He/She reviews proposed
86
+ initiatives for continuous improvement and monitors the adherence to customer
87
+ services standards. He outlines the organisations service, food hygiene, health
88
+ and safety standards. Resourceful and detail-oriented, he is able to serve as
89
+ a mentor who directs subordinates during kitchen operations. He possesses a service
90
+ mindset and guides his teams to anticipate customer needs. He is expected to work
91
+ long hours and handle the pressure in a fast-paced kitchen environment.
92
+ sentences:
93
+ - The Executive Sous Chef oversees kitchen operations by coordinating order flow
94
+ and guiding cooks on dish preparation, ensuring timely service and presentation.
95
+ He/She evaluates new initiatives for ongoing enhancement while ensuring compliance
96
+ with customer service standards. He establishes the organization’s food safety,
97
+ hygiene, and health protocols. With a resourceful and meticulous approach, he
98
+ mentors team members during kitchen activities, fostering a service-oriented atmosphere
99
+ that anticipates customer preferences. The role demands resilience and the ability
100
+ to thrive in a high-pressure culinary environment.
101
+ - 'The Executive Sous Chef is tasked with managing front-of-house operations by
102
+ coordinating dining room service and overseeing waitstaff during meal service,
103
+ ensuring a smooth dining experience for guests.
104
+
105
+
106
+ The Executive Sous Chef leads a team of junior chefs, focusing on daily kitchen
107
+ tasks without significant decision-making authority, and reports to the Head Chef
108
+ for all operational directives.
109
+
110
+
111
+ The Executive Sous Chef functions as a Marketing Associate in the hospitality
112
+ industry, utilizing analytical skills to develop promotional materials and campaigns,
113
+ while ensuring compliance with brand standards.
114
+
115
+
116
+ The Executive Sous Chef is responsible for managing kitchen operations in a remote
117
+ island resort, adapting to unique local regulations and culinary practices that
118
+ differ significantly from urban environments.
119
+
120
+
121
+ The Executive Sous Chef combines responsibilities from both the culinary and accounting
122
+ departments, overseeing kitchen inventory management while also preparing financial
123
+ reports and budget forecasts inaccurately.'
124
+ - The Lead Maintenance Engineer oversees reliability-focused maintenance initiatives
125
+ to guarantee the ongoing airworthiness of the aircraft fleet. He/She provides
126
+ direction to team members involved in aircraft maintenance operations and spearheads
127
+ asset performance evaluation. He manages intricate projects and formulates maintenance
128
+ strategies based on pertinent technical data, original equipment manufacturer
129
+ (OEM) guidelines, and regulatory standards. As a recognized authority in the field,
130
+ he is responsible for executing work instructions, ensuring quality control, and
131
+ enhancing workflow efficiencies to boost the organization's productivity. He actively
132
+ engages in technical and program assessments, reviews documentation, and ensures
133
+ adherence to engineering policies and procedures set forth by the organization,
134
+ clients, and regulatory bodies. He assesses compliance with airworthiness and
135
+ legislative standards while suggesting improvements to the organization's standard
136
+ operating procedures (SOPs) and safety, health, and quality systems. He plays
137
+ a proactive role in advancing lean and sustainable practices and conducts research
138
+ and innovation in specific areas for ongoing process enhancements. He evaluates
139
+ staff performance and provides coaching and mentoring to technical team members.
140
+ He should demonstrate strong decision-making, resource management, and project
141
+ management capabilities, as well as effective problem-solving, communication,
142
+ and stakeholder engagement skills to address unexpected challenges in fleet management.
143
+ - source_sentence: The Mechanical Engineer/Electrical Engineer manages the planning
144
+ and development of projects. He/She develops mechanical and/or electrical engineering
145
+ designs based on project requirements, from conceptual to schematic and detailed
146
+ designs. He is responsible for designing mechanical and electrical systems. He
147
+ conducts project assessments and is able to provide feasible and creative solutions
148
+ based on the assessment results. He participates in the tendering process and
149
+ assists with the projects' costs and budgets. He plans the team's manpower and
150
+ provides on-the-job coaching to junior staff. He is meticulous, highly detail-oriented
151
+ and has a keen interest to incorporate new technologies into engineering design
152
+ projects. He possesses excellent knowledge in mechanical and/or electrical engineering
153
+ fields, is analytical and has good problem-solving skills. He also possesses strong
154
+ interpersonal and project coordination skills crucial for engagement with internal
155
+ and external stakeholders. He is required to work both in office and at project
156
+ sites.
157
+ sentences:
158
+ - 'The Mechanical Designer oversees the aesthetic aspects of product design, focusing
159
+ on visual appeal rather than engineering functionality.
160
+
161
+
162
+ The Senior Electrical Engineer is responsible for high-level decision-making and
163
+ strategic planning without direct involvement in project execution or team management.
164
+
165
+
166
+ The Compliance Engineer in the pharmaceutical industry ensures that products meet
167
+ regulatory standards, utilizing similar analytical skills but in a completely
168
+ different context.
169
+
170
+
171
+ The Electrical Engineer in the European market adapts designs to comply with specific
172
+ EU regulations, requiring knowledge of different compliance standards and practices.
173
+
174
+
175
+ The Mechanical Engineer/Software Developer hybrid role combines hardware design
176
+ with software coding, leading to confusion between engineering responsibilities
177
+ and programming tasks.'
178
+ - The Barista Supervisor oversees the crafting of beverages in accordance with the
179
+ organization's established recipes and protocols. This role involves preparing
180
+ and suggesting unique, customized drinks to enhance customer satisfaction. To
181
+ ensure exceptional customer service, he/she regularly evaluates compliance with
182
+ service standards. The supervisor also assists in daily operations by organizing
183
+ staff schedules and initiating continuous improvement activities. Additionally,
184
+ he/she ensures adherence to service, food safety, and health regulations. With
185
+ a friendly demeanor and a service-oriented mindset, the supervisor effectively
186
+ manages various tasks while maintaining composure and confidence in interactions
187
+ with a diverse clientele. Flexibility in scheduling is essential, including availability
188
+ for weekends, late nights, and public holidays, along with the endurance to remain
189
+ on their feet for extended periods.
190
+ - The Mechanical Engineer/Electrical Engineer oversees the strategic planning and
191
+ execution of engineering projects. He/She creates innovative mechanical and/or
192
+ electrical designs tailored to project specifications, progressing from initial
193
+ concepts to fully detailed designs. His/her role includes the design of mechanical
194
+ and electrical systems, conducting thorough project evaluations, and delivering
195
+ practical and inventive solutions based on these evaluations. He/She plays an
196
+ active role in the bidding process and supports budget management for projects.
197
+ Additionally, he/she organizes team resources and provides mentorship to junior
198
+ engineers. With a meticulous approach and a passion for integrating cutting-edge
199
+ technologies into engineering designs, he/she demonstrates exceptional expertise
200
+ in mechanical and/or electrical engineering disciplines, strong analytical skills,
201
+ and proficient problem-solving abilities. He/She also possesses excellent interpersonal
202
+ and project management skills essential for effective collaboration with both
203
+ internal teams and external partners. The role requires working in both office
204
+ settings and on-site at project locations.
205
+ - source_sentence: The Assistant Coordination and Reservations Executive assists in
206
+ processing reservations of travel, including air tickets, hotels and attractions
207
+ and issues reservation slips for group reservations. He/She also processes refund
208
+ requests in cases of partially-utilised tickets and knows the airline terminology,
209
+ codes, fare basis, aviation rules and tariffs. Service-oriented with strong multi-tasking
210
+ skills, he liaises with suppliers and customer support department to coordinate
211
+ any changes to reservations. He is also able to perform in a fast paced environment
212
+ and perform checks on the availability of products and services with vendors and
213
+ holds reservations. He assists in the coordination of travel operations including
214
+ arranging of tickets to attractions, coaches, meals and hotel rooms allocation.
215
+ He may be required to work on weekends, evenings, and public holidays in an office
216
+ environment.
217
+ sentences:
218
+ - "The Assistant Coordination and Reservations Executive oversees the management\
219
+ \ of corporate travel budgets, including the negotiation of contracts with travel\
220
+ \ vendors. \n\nThe Assistant Coordination and Reservations Executive leads a team\
221
+ \ of travel consultants, focusing on strategic planning and long-term travel program\
222
+ \ development.\n\nThe Assistant Coordination and Reservations Executive is responsible\
223
+ \ for compliance audits in the healthcare industry, ensuring adherence to regulatory\
224
+ \ standards and internal policies.\n\nThe Assistant Coordination and Reservations\
225
+ \ Executive manages international travel logistics for corporate clients, navigating\
226
+ \ complex global regulations and cross-border travel requirements.\n\nThe Assistant\
227
+ \ Coordination and Reservations Executive combines event planning and travel coordination,\
228
+ \ tasked with organizing large-scale conferences while also handling personal\
229
+ \ travel arrangements."
230
+ - The Automation Technician is responsible for the operation and maintenance of
231
+ automation equipment and systems utilized in stage productions, working under
232
+ the guidance of senior team members. This role involves setting parameters for
233
+ automated stage elements and collaborating with various stakeholders to optimize
234
+ programming and make necessary adjustments for precise movements and placements
235
+ that align with design intentions. During live performances, the technician will
236
+ manage automation systems in accordance with stage cues to ensure operational
237
+ safety and fluidity. Additionally, the Automation Technician plays a key role
238
+ in tracking maintenance, troubleshooting issues, and repairing equipment as needed.
239
+ This position can be offered on a full-time or casual basis in venues, rental
240
+ firms, production companies, or directly for productions.
241
+ - The Assistant Coordination and Reservations Executive plays a crucial role in
242
+ managing travel bookings, encompassing airfares, accommodations, and attractions.
243
+ This position involves generating reservation confirmations for group travel and
244
+ processing refund applications for partially used tickets. The ideal candidate
245
+ is knowledgeable about airline jargon, fare structures, aviation regulations,
246
+ and pricing. With a focus on customer service and exceptional multitasking capabilities,
247
+ the executive collaborates with suppliers and the customer support team to adjust
248
+ reservations as needed. Capable of thriving in a dynamic environment, he/she checks
249
+ product and service availability with vendors and manages bookings efficiently.
250
+ The role includes organizing travel logistics such as tickets for attractions,
251
+ transportation, dining, and hotel assignments, with the expectation of availability
252
+ to work weekends, evenings, and public holidays in an office setting.
253
+ - source_sentence: The Principal Security Consultant is responsible for leading a
254
+ team to clinch consultancy projects to provide security audits, reviews and security
255
+ risk assessment services to clients and recommend improvements to existing security
256
+ measures. He/She is required to evaluate tender documents and manage the deployment
257
+ of security consultants to develop security protection and implementation plans
258
+ for various types of facilities. He is required to work in an office environment
259
+ and perform site visits when necessary. He is expected to communicate with relevant
260
+ stakeholders and clients as part of his role in performing the respective duties.
261
+ This requires him to be analytical, responsive, decisive and cooperative.
262
+ sentences:
263
+ - The Senior Principal Occupational Therapy Educator is responsible for guiding
264
+ and evaluating various training initiatives and programs within the department.
265
+ This role involves providing specialized training to occupational therapists and
266
+ spearheading professional development efforts. The educator develops and executes
267
+ frameworks that enhance learning opportunities throughout the department. The
268
+ position may be situated in diverse environments, including public and private
269
+ sectors, acute care hospitals, rehabilitation facilities, voluntary welfare organizations,
270
+ educational institutions, and client homes. Collaboration with interdisciplinary
271
+ teams, which may comprise educators, healthcare professionals, and therapists,
272
+ is also a key aspect of the role. The ideal candidate should demonstrate visionary
273
+ leadership, innovation, and a strong commitment to the advancement of therapists'
274
+ skills. Effective interpersonal, communication, and team-building abilities are
275
+ essential for success in this position.
276
+ - The Principal Security Consultant is tasked with spearheading a team to secure
277
+ consultancy projects focused on delivering security audits, assessments, and risk
278
+ evaluations to clients, while also suggesting enhancements to current security
279
+ protocols. This role involves reviewing tender documents and overseeing the assignment
280
+ of security specialists to formulate protection strategies and implementation
281
+ plans for diverse facilities. The position primarily operates in an office setting,
282
+ with occasional site visits required. Effective communication with stakeholders
283
+ and clients is essential for fulfilling the responsibilities of this role, demanding
284
+ strong analytical, responsive, decisive, and collaborative skills.
285
+ - "The Principal Security Consultant is responsible for managing a team to oversee\
286
+ \ marketing campaigns, analyzing market trends, and developing promotional strategies\
287
+ \ for various products. \n\nThe Principal Security Consultant oversees the operations\
288
+ \ of a junior audit team, focusing on compliance with internal policies and ensuring\
289
+ \ adherence to basic regulatory requirements.\n\nThe Principal Security Consultant\
290
+ \ is engaged in financial analysis, requiring expertise in budget forecasting\
291
+ \ and revenue management within the retail sector.\n\nThe Principal Security Consultant\
292
+ \ is tasked with leading cross-border legal compliance initiatives, requiring\
293
+ \ knowledge of international law and regulations specific to the European market.\n\
294
+ \nThe Principal Security Consultant combines project management and IT support\
295
+ \ roles, focusing on both strategic planning and technical troubleshooting for\
296
+ \ software applications."
297
+ datasets:
298
+ - frankwong2001/ssf-train-valid-full-synthetic-v3
299
+ pipeline_tag: sentence-similarity
300
+ library_name: sentence-transformers
301
+ ---
302
+
303
+ # SentenceTransformer based on nomic-ai/modernbert-embed-base
304
+
305
+ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [nomic-ai/modernbert-embed-base](https://huggingface.co/nomic-ai/modernbert-embed-base) on the [ssf-train-valid-full-synthetic-v3](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-v3) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
306
+
307
+ ## Model Details
308
+
309
+ ### Model Description
310
+ - **Model Type:** Sentence Transformer
311
+ - **Base model:** [nomic-ai/modernbert-embed-base](https://huggingface.co/nomic-ai/modernbert-embed-base) <!-- at revision d556a88e332558790b210f7bdbe87da2fa94a8d8 -->
312
+ - **Maximum Sequence Length:** 8192 tokens
313
+ - **Output Dimensionality:** 768 dimensions
314
+ - **Similarity Function:** Cosine Similarity
315
+ - **Training Dataset:**
316
+ - [ssf-train-valid-full-synthetic-v3](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-v3)
317
+ <!-- - **Language:** Unknown -->
318
+ <!-- - **License:** Unknown -->
319
+
320
+ ### Model Sources
321
+
322
+ - **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
323
+ - **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
324
+ - **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
325
+
326
+ ### Full Model Architecture
327
+
328
+ ```
329
+ SentenceTransformer(
330
+ (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
331
+ (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
332
+ (2): Normalize()
333
+ )
334
+ ```
335
+
336
+ ## Usage
337
+
338
+ ### Direct Usage (Sentence Transformers)
339
+
340
+ First install the Sentence Transformers library:
341
+
342
+ ```bash
343
+ pip install -U sentence-transformers
344
+ ```
345
+
346
+ Then you can load this model and run inference.
347
+ ```python
348
+ from sentence_transformers import SentenceTransformer
349
+
350
+ # Download from the 🤗 Hub
351
+ model = SentenceTransformer("frankwong2001/3_modernbert-embed-base")
352
+ # Run inference
353
+ sentences = [
354
+ 'The Principal Security Consultant is responsible for leading a team to clinch consultancy projects to provide security audits, reviews and security risk assessment services to clients and recommend improvements to existing security measures. He/She is required to evaluate tender documents and manage the deployment of security consultants to develop security protection and implementation plans for various types of facilities. He is required to work in an office environment and perform site visits when necessary. He is expected to communicate with relevant stakeholders and clients as part of his role in performing the respective duties. This requires him to be analytical, responsive, decisive and cooperative.',
355
+ 'The Principal Security Consultant is tasked with spearheading a team to secure consultancy projects focused on delivering security audits, assessments, and risk evaluations to clients, while also suggesting enhancements to current security protocols. This role involves reviewing tender documents and overseeing the assignment of security specialists to formulate protection strategies and implementation plans for diverse facilities. The position primarily operates in an office setting, with occasional site visits required. Effective communication with stakeholders and clients is essential for fulfilling the responsibilities of this role, demanding strong analytical, responsive, decisive, and collaborative skills.',
356
+ 'The Principal Security Consultant is responsible for managing a team to oversee marketing campaigns, analyzing market trends, and developing promotional strategies for various products. \n\nThe Principal Security Consultant oversees the operations of a junior audit team, focusing on compliance with internal policies and ensuring adherence to basic regulatory requirements.\n\nThe Principal Security Consultant is engaged in financial analysis, requiring expertise in budget forecasting and revenue management within the retail sector.\n\nThe Principal Security Consultant is tasked with leading cross-border legal compliance initiatives, requiring knowledge of international law and regulations specific to the European market.\n\nThe Principal Security Consultant combines project management and IT support roles, focusing on both strategic planning and technical troubleshooting for software applications.',
357
+ ]
358
+ embeddings = model.encode(sentences)
359
+ print(embeddings.shape)
360
+ # [3, 768]
361
+
362
+ # Get the similarity scores for the embeddings
363
+ similarities = model.similarity(embeddings, embeddings)
364
+ print(similarities)
365
+ # tensor([[1.0000, 0.9259, 0.2866],
366
+ # [0.9259, 1.0000, 0.3057],
367
+ # [0.2866, 0.3057, 1.0000]])
368
+ ```
369
+
370
+ <!--
371
+ ### Direct Usage (Transformers)
372
+
373
+ <details><summary>Click to see the direct usage in Transformers</summary>
374
+
375
+ </details>
376
+ -->
377
+
378
+ <!--
379
+ ### Downstream Usage (Sentence Transformers)
380
+
381
+ You can finetune this model on your own dataset.
382
+
383
+ <details><summary>Click to expand</summary>
384
+
385
+ </details>
386
+ -->
387
+
388
+ <!--
389
+ ### Out-of-Scope Use
390
+
391
+ *List how the model may foreseeably be misused and address what users ought not to do with the model.*
392
+ -->
393
+
394
+ <!--
395
+ ## Bias, Risks and Limitations
396
+
397
+ *What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
398
+ -->
399
+
400
+ <!--
401
+ ### Recommendations
402
+
403
+ *What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
404
+ -->
405
+
406
+ ## Training Details
407
+
408
+ ### Training Dataset
409
+
410
+ #### ssf-train-valid-full-synthetic-v3
411
+
412
+ * Dataset: [ssf-train-valid-full-synthetic-v3](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-v3) at [b816c6b](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-v3/tree/b816c6b9a13eb26993c3f9be6bf585ff5b3097c4)
413
+ * Size: 3,016 training samples
414
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
415
+ * Approximate statistics based on the first 1000 samples:
416
+ | | anchor | positive | negative |
417
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
418
+ | type | string | string | string |
419
+ | details | <ul><li>min: 66 tokens</li><li>mean: 169.89 tokens</li><li>max: 355 tokens</li></ul> | <ul><li>min: 67 tokens</li><li>mean: 160.73 tokens</li><li>max: 323 tokens</li></ul> | <ul><li>min: 80 tokens</li><li>mean: 296.5 tokens</li><li>max: 1718 tokens</li></ul> |
420
+ * Samples:
421
+ | anchor | positive | negative |
422
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
423
+ | <code>The Branch Manager is responsible for ensuring the achievement of the branch's financial targets. He/She is responsible for all functions of the branch under his care, such as hiring employees, implementing service initiatives, overseeing the approval of loans and lines of credit, marketing, and assisting with customer relations. He effectively manages team members within his branch, including developing and motivating them to perform and achieve sales targets. The Branch Manager may occasionally be required to work weekends and after hours. He has good organisational skills, is diligent and possesses strong people management capabilities. He is customer focused, has keen problem solving abilities and is able to manage internal and external stakeholders from a variety of backgrounds.</code> | <code>The Branch Manager oversees the successful attainment of the branch's financial objectives. This role encompasses all operational aspects of the branch, including recruiting staff, executing service strategies, managing loan and credit approvals, conducting marketing efforts, and facilitating customer engagement. The Branch Manager adeptly leads team members, fostering their development and driving them towards achieving sales goals. Occasionally, this position may require working on weekends and after hours. The ideal candidate demonstrates excellent organizational skills, a strong work ethic, and possesses exceptional people management abilities. A customer-centric approach, along with proficient problem-solving skills and the ability to engage with diverse internal and external stakeholders, is essential.</code> | <code>The Branch Manager is tasked with ensuring compliance with environmental regulations across various departments. He/She is responsible for all aspects of environmental audits, including recruiting compliance officers, implementing sustainability initiatives, overseeing the approval of waste disposal methods, marketing green practices, and managing community relations. He effectively collaborates with team members to enhance compliance metrics and achieve sustainability targets. The Branch Manager may be required to work in extreme weather conditions. He has strong analytical skills, is detail-oriented, and possesses excellent project management capabilities. He is focused on environmental impact, has sharp analytical abilities, and can manage regulatory stakeholders from different sectors.<br><br>The Branch Manager is accountable for the strategic growth of multiple branches within an international organization. This senior role involves high-level decision-making, including mergers and acqu...</code> |
424
+ | <code>The Head of IT Audit develops the organisation's IT audit framework to manage regulatory and operational risks to safeguard IT assets. He/She defines key objectives and guiding principles for the formulation of IT risk management programs, as well as procedures for documenting and updating policies, standards, guidelines relating to the management of IT assets. He advices on the development of IT audit plans and ensures that audit plans comply with regulatory, operational, security risks and relevant internal auditing standards. He oversees the conduct of audits, respective investigations into non-compliance and risks identified from audits. He overlooks new IT policies, systems and processes necessary for enhancing IT controls and mitigate risks. He consults with and advises senior leaders regarding internal controls and security procedures, prepares activity and progress reports relating to the IT audit function. He also guide team members on procedures, technical problems, prioritie...</code> | <code>The Head of IT Audit is responsible for establishing a comprehensive IT audit framework that effectively manages regulatory and operational risks to protect the organization’s IT assets. This role involves defining essential objectives and principles for developing IT risk management strategies, as well as creating procedures for the documentation and updating of policies, standards, and guidelines related to IT asset management. The individual provides strategic advice on IT audit planning and ensures compliance with relevant regulations, operational standards, and security risks in line with internal auditing practices. Additionally, the Head of IT Audit supervises audit execution, investigates instances of non-compliance, and addresses risks identified during audits. The role includes overseeing the implementation of new IT policies, systems, and processes to strengthen IT controls and mitigate risks. The Head of IT Audit collaborates with senior leadership to enhance internal contr...</code> | <code>The Head of IT Audit is tasked with managing financial audits across various departments to ensure compliance with budgetary constraints and fiscal regulations. <br><br>The Head of IT Audit functions as a Junior Audit Associate, responsible for assisting in the execution of basic audit tasks under close supervision, with limited decision-making authority.<br><br>The Head of IT Audit serves as a Compliance Officer in the healthcare sector, focusing on regulatory compliance and quality assurance in clinical practices rather than IT environments.<br><br>The Head of IT Audit oversees cross-border financial reporting for international subsidiaries, navigating different accounting standards and regulations while ensuring compliance with local market practices.<br><br>The Head of IT Audit combines responsibilities of a Project Manager and a Risk Analyst, overseeing multiple unrelated projects while simultaneously assessing risks across diverse business units without a clear focus on IT auditing.</code> |
425
+ | <code>A Nurse Manager is responsible for planning, coordinating, directing, and evaluating operational activities and resource utilisation in the department. S/He is also responsible for managing nursing manpower operating expenses and budget effectively to provide high quality patient care. S/He oversees at least one unit. S/He oversees the professional and personal development of all staff under her/his charge. Her/His core function is in managerial tasks, but s/he will also perform some clinical, educational and research tasks in the course of her/his day-to-day work. S/He provides guidance to assistant nurse clinicians and below to ensure optimal care is provided to meet desired patient outcomes and experience. S/He operates in a wide variety of settings such as acute care, primary care, community hospitals, integrated care and long-term care facilities. S/He should be resourceful, prudent, tactful and persuasive.</code> | <code>The Nurse Manager is tasked with organizing, supervising, and assessing the operational functions and resource allocation within the department. They are accountable for effectively managing nursing workforce budgets and expenses to ensure the delivery of exemplary patient care. The Nurse Manager oversees at least one nursing unit and is responsible for fostering both the professional and personal growth of all team members under their leadership. While primarily focused on managerial duties, they also engage in clinical, educational, and research activities as part of their daily responsibilities. Additionally, they provide support and direction to assistant nurse clinicians and lower-level staff to guarantee that optimal care is provided, aligning with desired patient outcomes and experiences. The role is carried out in diverse environments, including acute care, primary care, community hospitals, integrated care, and long-term care facilities. Ideal candidates will be resourceful, p...</code> | <code>The Nurse Manager is focused on developing marketing strategies, coordinating promotional activities, and evaluating the effectiveness of advertising campaigns in the department. They are responsible for managing the budget for marketing materials and operational expenses to enhance brand visibility. The Nurse Manager oversees at least one team of marketing specialists and is tasked with driving the professional growth of all personnel involved. While their primary role is in marketing tasks, they will also conduct some sales training and customer research during their daily activities. They provide mentorship to junior marketing associates and below to ensure effective communication is maintained with clients. The position is performed in various sectors, such as retail, hospitality, and entertainment. Candidates should be innovative, analytical, strategic, and persuasive.<br><br>The Nurse Manager is responsible for supervising nursing operations at a senior level, managing large-scale budg...</code> |
426
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
427
+ ```json
428
+ {
429
+ "scale": 20.0,
430
+ "similarity_fct": "cos_sim",
431
+ "gather_across_devices": false
432
+ }
433
+ ```
434
+
435
+ ### Evaluation Dataset
436
+
437
+ #### ssf-train-valid-full-synthetic-v3
438
+
439
+ * Dataset: [ssf-train-valid-full-synthetic-v3](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-v3) at [b816c6b](https://huggingface.co/datasets/frankwong2001/ssf-train-valid-full-synthetic-v3/tree/b816c6b9a13eb26993c3f9be6bf585ff5b3097c4)
440
+ * Size: 754 evaluation samples
441
+ * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
442
+ * Approximate statistics based on the first 754 samples:
443
+ | | anchor | positive | negative |
444
+ |:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:--------------------------------------------------------------------------------------|
445
+ | type | string | string | string |
446
+ | details | <ul><li>min: 58 tokens</li><li>mean: 169.85 tokens</li><li>max: 380 tokens</li></ul> | <ul><li>min: 54 tokens</li><li>mean: 160.26 tokens</li><li>max: 362 tokens</li></ul> | <ul><li>min: 94 tokens</li><li>mean: 304.69 tokens</li><li>max: 1330 tokens</li></ul> |
447
+ * Samples:
448
+ | anchor | positive | negative |
449
+ |:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
450
+ | <code>The Venue Operations Manager is responsible for overseeing the running of venue operations, including the logistics requirements. He/She works closely with event services department to ensure client requirements are fulfilled in compliance to local health and safety standards. He reviews event plans to ensure generation of maximum yield for organisation. Meticulous and resourceful, he possesses excellent problem-solving skills and is able to react quickly to deviations in the project plans. He is able to work in a flexible workweek, including weekends, evenings, and public holidays, and is comfortable working in both an indoor and outdoor environment depending on the nature and requirements of the events.</code> | <code>The Venue Operations Manager is tasked with managing the day-to-day functions of venue operations, including logistical planning and execution. Collaborating closely with the event services team, he/she ensures that client needs are met while adhering to local health and safety regulations. The manager evaluates event proposals to maximize organizational profitability. Detail-oriented and innovative, he/she demonstrates strong problem-solving abilities and can swiftly adjust to changes in project timelines. Flexibility in working hours is essential, as the role may require availability during weekends, evenings, and public holidays, and the manager must be adept at working in various environments, whether indoor or outdoor, based on event specifications.</code> | <code>The Venue Operations Coordinator is involved in managing the administrative tasks of venue bookings, focusing on client communications rather than logistics. <br><br>The Venue Operations Director oversees large-scale venue strategies and has decision-making authority over multiple departments, requiring extensive experience in leadership roles.<br><br>The Venue Operations Specialist is responsible for compliance checks in a manufacturing facility, utilizing similar analytical skills but within a different regulatory framework.<br><br>The Venue Operations Manager in a different country is tasked with adhering to international safety standards while managing events that cater to a diverse clientele across borders.<br><br>The Venue Operations Manager combines responsibilities of a catering manager and a maintenance supervisor, overseeing food services while also handling facility repairs, which creates confusion in role expectations.</code> |
451
+ | <code>The Network Development Principal Engineer provides technical leadership to the network development team and develops detailed project plans for electricity transmission and/or distribution network development and/or the integration of distributed generation sources and energy storage systems with the grid. As the technical expert, he/she reviews project progress reports and investigation findings of site problems encountered to propose follow- up actions. He reviews installation plans for metering equipment and sensors, and leads process improvement initiatives. He leads technical capability development programmes, including on-the-job training and coaching, and formulates the technical training and development plans for the teams. He manages the Permits-to-Work for the team, and establishes Safe System of Work (SSoW) frameworks and practices for his area of work. He proposes emergency technical and recovery activities based on the crisis management framework, and determines the respo...</code> | <code>The Network Development Principal Engineer offers expert technical guidance to the network development team, crafting comprehensive project plans for the enhancement of electricity transmission and distribution networks, as well as the incorporation of distributed energy resources and storage systems into the grid. Acting as the technical authority, he/she evaluates project progress reports and conducts investigations into site issues, recommending appropriate follow-up actions. He/she assesses installation strategies for metering devices and sensors, spearheads initiatives for process enhancements, and directs technical capability-building programs, including hands-on training and mentorship, while devising technical training and development strategies for the team. Additionally, he/she oversees the management of Permits-to-Work for the team, implementing Safe System of Work (SSoW) frameworks and practices specific to the area of responsibility. He/she formulates emergency technical a...</code> | <code>The Network Development Principal Engineer manages the financial operations of the network development team, focusing on budgeting and expense tracking for electricity transmission projects. <br><br>The Network Development Principal Engineer serves as a Junior Engineer who assists with basic tasks under close supervision, requiring minimal experience in network development and no decision-making authority.<br><br>The Network Development Principal Engineer operates as a Compliance Associate in the healthcare sector, ensuring that all regulatory requirements are met while analyzing data for compliance with health regulations.<br><br>The Network Development Principal Engineer focuses on network development projects in a different regulatory environment, specifically in the European market, adapting to varied local standards and practices.<br><br>The Network Development Principal Engineer combines the roles of a project manager and a sales representative, overseeing project timelines while also directly selling n...</code> |
452
+ | <code>The Keeper/Aquarist is responsible for the care and management of wildlife within the organisation/attractions sites. This includes the preparation and feeding of food to the wildlife, caring for ill animals, checking enclosures and cages for signs of wear or damage for animal, staff and visitor safety, and giving educational talks/tours to the visitors. He/she also maintains animal training behaviours and promotes conservation awareness through his animal presentations. Detail-oriented with a strong passion for animals, he is attentive to the needs of the wildlife under his care, and leverages his strong communication skills to communicate effectively with visitors on the characteristics and behaviours of the wildlife. He is physically fit and works in a shift system encompassing weekends and public holidays. Outside the working hours, he may be on a rota for call-outs. He often stays outdoors for long periods of time even through inclement weather, and may need a driving licence if h...</code> | <code>The Keeper/Aquarist is entrusted with the comprehensive care and management of wildlife within the organization’s attractions. This role involves preparing and providing nutritious food for the animals, tending to those that are unwell, and inspecting enclosures for any signs of damage to ensure the safety of animals, staff, and visitors. The Keeper/Aquarist also engages guests through educational talks and tours, enhancing their understanding of wildlife conservation. With a keen eye for detail and a deep passion for animals, he/she is dedicated to meeting the needs of the creatures in their care and utilizes strong communication skills to effectively share insights about animal behavior and characteristics with visitors. The position requires physical fitness, as it involves working shifts that include weekends and public holidays. Additionally, the Keeper/Aquarist may be on call outside regular hours and is accustomed to spending extended periods outdoors, regardless of weather cond...</code> | <code>The Keeper/Aquarist is responsible for managing the administrative tasks related to wildlife licensing and permits within the organization. This includes ensuring compliance with local regulations, filing necessary documents, and maintaining records, without direct interaction with the animals or visitors.<br><br>The Keeper/Aquarist oversees a team of junior staff members and is focused on strategic planning for conservation policies rather than hands-on animal care. This role requires several years of experience in management and decision-making authority regarding budget allocations and resource management.<br><br>The Keeper/Aquarist works as a Compliance Associate in the banking sector, where he/she analyzes regulatory frameworks and ensures adherence to financial policies. This position requires a strong understanding of financial compliance rather than animal welfare.<br><br>The Keeper/Aquarist is tasked with wildlife management in an international context, focusing on cross-border regulatory issue...</code> |
453
+ * Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
454
+ ```json
455
+ {
456
+ "scale": 20.0,
457
+ "similarity_fct": "cos_sim",
458
+ "gather_across_devices": false
459
+ }
460
+ ```
461
+
462
+ ### Training Hyperparameters
463
+ #### Non-Default Hyperparameters
464
+
465
+ - `eval_strategy`: epoch
466
+ - `per_device_train_batch_size`: 32
467
+ - `per_device_eval_batch_size`: 16
468
+ - `gradient_accumulation_steps`: 16
469
+ - `learning_rate`: 2e-05
470
+ - `num_train_epochs`: 5
471
+ - `lr_scheduler_type`: cosine
472
+ - `warmup_ratio`: 0.1
473
+ - `bf16`: True
474
+ - `tf32`: False
475
+ - `load_best_model_at_end`: True
476
+ - `batch_sampler`: no_duplicates
477
+
478
+ #### All Hyperparameters
479
+ <details><summary>Click to expand</summary>
480
+
481
+ - `overwrite_output_dir`: False
482
+ - `do_predict`: False
483
+ - `eval_strategy`: epoch
484
+ - `prediction_loss_only`: True
485
+ - `per_device_train_batch_size`: 32
486
+ - `per_device_eval_batch_size`: 16
487
+ - `per_gpu_train_batch_size`: None
488
+ - `per_gpu_eval_batch_size`: None
489
+ - `gradient_accumulation_steps`: 16
490
+ - `eval_accumulation_steps`: None
491
+ - `torch_empty_cache_steps`: None
492
+ - `learning_rate`: 2e-05
493
+ - `weight_decay`: 0.0
494
+ - `adam_beta1`: 0.9
495
+ - `adam_beta2`: 0.999
496
+ - `adam_epsilon`: 1e-08
497
+ - `max_grad_norm`: 1.0
498
+ - `num_train_epochs`: 5
499
+ - `max_steps`: -1
500
+ - `lr_scheduler_type`: cosine
501
+ - `lr_scheduler_kwargs`: {}
502
+ - `warmup_ratio`: 0.1
503
+ - `warmup_steps`: 0
504
+ - `log_level`: passive
505
+ - `log_level_replica`: warning
506
+ - `log_on_each_node`: True
507
+ - `logging_nan_inf_filter`: True
508
+ - `save_safetensors`: True
509
+ - `save_on_each_node`: False
510
+ - `save_only_model`: False
511
+ - `restore_callback_states_from_checkpoint`: False
512
+ - `no_cuda`: False
513
+ - `use_cpu`: False
514
+ - `use_mps_device`: False
515
+ - `seed`: 42
516
+ - `data_seed`: None
517
+ - `jit_mode_eval`: False
518
+ - `use_ipex`: False
519
+ - `bf16`: True
520
+ - `fp16`: False
521
+ - `fp16_opt_level`: O1
522
+ - `half_precision_backend`: auto
523
+ - `bf16_full_eval`: False
524
+ - `fp16_full_eval`: False
525
+ - `tf32`: False
526
+ - `local_rank`: 0
527
+ - `ddp_backend`: None
528
+ - `tpu_num_cores`: None
529
+ - `tpu_metrics_debug`: False
530
+ - `debug`: []
531
+ - `dataloader_drop_last`: False
532
+ - `dataloader_num_workers`: 0
533
+ - `dataloader_prefetch_factor`: None
534
+ - `past_index`: -1
535
+ - `disable_tqdm`: False
536
+ - `remove_unused_columns`: True
537
+ - `label_names`: None
538
+ - `load_best_model_at_end`: True
539
+ - `ignore_data_skip`: False
540
+ - `fsdp`: []
541
+ - `fsdp_min_num_params`: 0
542
+ - `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
543
+ - `fsdp_transformer_layer_cls_to_wrap`: None
544
+ - `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
545
+ - `deepspeed`: None
546
+ - `label_smoothing_factor`: 0.0
547
+ - `optim`: adamw_torch_fused
548
+ - `optim_args`: None
549
+ - `adafactor`: False
550
+ - `group_by_length`: False
551
+ - `length_column_name`: length
552
+ - `ddp_find_unused_parameters`: None
553
+ - `ddp_bucket_cap_mb`: None
554
+ - `ddp_broadcast_buffers`: False
555
+ - `dataloader_pin_memory`: True
556
+ - `dataloader_persistent_workers`: False
557
+ - `skip_memory_metrics`: True
558
+ - `use_legacy_prediction_loop`: False
559
+ - `push_to_hub`: False
560
+ - `resume_from_checkpoint`: None
561
+ - `hub_model_id`: None
562
+ - `hub_strategy`: every_save
563
+ - `hub_private_repo`: None
564
+ - `hub_always_push`: False
565
+ - `hub_revision`: None
566
+ - `gradient_checkpointing`: False
567
+ - `gradient_checkpointing_kwargs`: None
568
+ - `include_inputs_for_metrics`: False
569
+ - `include_for_metrics`: []
570
+ - `eval_do_concat_batches`: True
571
+ - `fp16_backend`: auto
572
+ - `push_to_hub_model_id`: None
573
+ - `push_to_hub_organization`: None
574
+ - `mp_parameters`:
575
+ - `auto_find_batch_size`: False
576
+ - `full_determinism`: False
577
+ - `torchdynamo`: None
578
+ - `ray_scope`: last
579
+ - `ddp_timeout`: 1800
580
+ - `torch_compile`: False
581
+ - `torch_compile_backend`: None
582
+ - `torch_compile_mode`: None
583
+ - `include_tokens_per_second`: False
584
+ - `include_num_input_tokens_seen`: False
585
+ - `neftune_noise_alpha`: None
586
+ - `optim_target_modules`: None
587
+ - `batch_eval_metrics`: False
588
+ - `eval_on_start`: False
589
+ - `use_liger_kernel`: False
590
+ - `liger_kernel_config`: None
591
+ - `eval_use_gather_object`: False
592
+ - `average_tokens_across_devices`: False
593
+ - `prompts`: None
594
+ - `batch_sampler`: no_duplicates
595
+ - `multi_dataset_batch_sampler`: proportional
596
+ - `router_mapping`: {}
597
+ - `learning_rate_mapping`: {}
598
+
599
+ </details>
600
+
601
+ ### Training Logs
602
+ | Epoch | Step | Training Loss | Validation Loss |
603
+ |:-------:|:------:|:-------------:|:---------------:|
604
+ | 1.0 | 6 | 0.2184 | 0.0124 |
605
+ | 2.0 | 12 | 0.0066 | 0.0010 |
606
+ | 3.0 | 18 | 0.0031 | 0.0006 |
607
+ | 4.0 | 24 | 0.0027 | 0.0006 |
608
+ | **5.0** | **30** | **0.002** | **0.0005** |
609
+
610
+ * The bold row denotes the saved checkpoint.
611
+
612
+ ### Framework Versions
613
+ - Python: 3.12.11
614
+ - Sentence Transformers: 5.1.0
615
+ - Transformers: 4.55.0
616
+ - PyTorch: 2.8.0+cu128
617
+ - Accelerate: 1.10.0
618
+ - Datasets: 4.0.0
619
+ - Tokenizers: 0.21.4
620
+
621
+ ## Citation
622
+
623
+ ### BibTeX
624
+
625
+ #### Sentence Transformers
626
+ ```bibtex
627
+ @inproceedings{reimers-2019-sentence-bert,
628
+ title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
629
+ author = "Reimers, Nils and Gurevych, Iryna",
630
+ booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
631
+ month = "11",
632
+ year = "2019",
633
+ publisher = "Association for Computational Linguistics",
634
+ url = "https://arxiv.org/abs/1908.10084",
635
+ }
636
+ ```
637
+
638
+ #### MultipleNegativesRankingLoss
639
+ ```bibtex
640
+ @misc{henderson2017efficient,
641
+ title={Efficient Natural Language Response Suggestion for Smart Reply},
642
+ author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
643
+ year={2017},
644
+ eprint={1705.00652},
645
+ archivePrefix={arXiv},
646
+ primaryClass={cs.CL}
647
+ }
648
+ ```
649
+
650
+ <!--
651
+ ## Glossary
652
+
653
+ *Clearly define terms in order to be accessible across audiences.*
654
+ -->
655
+
656
+ <!--
657
+ ## Model Card Authors
658
+
659
+ *Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
660
+ -->
661
+
662
+ <!--
663
+ ## Model Card Contact
664
+
665
+ *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
666
+ -->
config.json ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "ModernBertModel"
4
+ ],
5
+ "attention_bias": false,
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 50281,
8
+ "classifier_activation": "gelu",
9
+ "classifier_bias": false,
10
+ "classifier_dropout": 0.0,
11
+ "classifier_pooling": "mean",
12
+ "cls_token_id": 50281,
13
+ "decoder_bias": true,
14
+ "deterministic_flash_attn": false,
15
+ "embedding_dropout": 0.0,
16
+ "eos_token_id": 50282,
17
+ "global_attn_every_n_layers": 3,
18
+ "global_rope_theta": 160000.0,
19
+ "gradient_checkpointing": false,
20
+ "hidden_activation": "gelu",
21
+ "hidden_size": 768,
22
+ "initializer_cutoff_factor": 2.0,
23
+ "initializer_range": 0.02,
24
+ "intermediate_size": 1152,
25
+ "layer_norm_eps": 1e-05,
26
+ "local_attention": 128,
27
+ "local_rope_theta": 10000.0,
28
+ "max_position_embeddings": 8192,
29
+ "mlp_bias": false,
30
+ "mlp_dropout": 0.0,
31
+ "model_type": "modernbert",
32
+ "norm_bias": false,
33
+ "norm_eps": 1e-05,
34
+ "num_attention_heads": 12,
35
+ "num_hidden_layers": 22,
36
+ "pad_token_id": 50283,
37
+ "position_embedding_type": "absolute",
38
+ "repad_logits_with_grad": false,
39
+ "sep_token_id": 50282,
40
+ "sparse_pred_ignore_index": -100,
41
+ "sparse_prediction": false,
42
+ "torch_dtype": "float32",
43
+ "transformers_version": "4.55.0",
44
+ "vocab_size": 50368
45
+ }
config_sentence_transformers.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "__version__": {
3
+ "sentence_transformers": "5.1.0",
4
+ "transformers": "4.55.0",
5
+ "pytorch": "2.8.0+cu128"
6
+ },
7
+ "prompts": {
8
+ "query": "",
9
+ "document": ""
10
+ },
11
+ "default_prompt_name": null,
12
+ "similarity_fn_name": "cosine",
13
+ "model_type": "SentenceTransformer"
14
+ }
model.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:12173c7f242e7fb0c7eb4731687d0b8da0a05b9c3af38ec4e67b90cc6699952c
3
+ size 596070136
modules.json ADDED
@@ -0,0 +1,20 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ [
2
+ {
3
+ "idx": 0,
4
+ "name": "0",
5
+ "path": "",
6
+ "type": "sentence_transformers.models.Transformer"
7
+ },
8
+ {
9
+ "idx": 1,
10
+ "name": "1",
11
+ "path": "1_Pooling",
12
+ "type": "sentence_transformers.models.Pooling"
13
+ },
14
+ {
15
+ "idx": 2,
16
+ "name": "2",
17
+ "path": "2_Normalize",
18
+ "type": "sentence_transformers.models.Normalize"
19
+ }
20
+ ]
sentence_bert_config.json ADDED
@@ -0,0 +1,4 @@
 
 
 
 
 
1
+ {
2
+ "max_seq_length": 8192,
3
+ "do_lower_case": false
4
+ }
special_tokens_map.json ADDED
@@ -0,0 +1,37 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": {
3
+ "content": "[CLS]",
4
+ "lstrip": false,
5
+ "normalized": false,
6
+ "rstrip": false,
7
+ "single_word": false
8
+ },
9
+ "mask_token": {
10
+ "content": "[MASK]",
11
+ "lstrip": true,
12
+ "normalized": false,
13
+ "rstrip": false,
14
+ "single_word": false
15
+ },
16
+ "pad_token": {
17
+ "content": "[PAD]",
18
+ "lstrip": false,
19
+ "normalized": false,
20
+ "rstrip": false,
21
+ "single_word": false
22
+ },
23
+ "sep_token": {
24
+ "content": "[SEP]",
25
+ "lstrip": false,
26
+ "normalized": false,
27
+ "rstrip": false,
28
+ "single_word": false
29
+ },
30
+ "unk_token": {
31
+ "content": "[UNK]",
32
+ "lstrip": false,
33
+ "normalized": false,
34
+ "rstrip": false,
35
+ "single_word": false
36
+ }
37
+ }
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,945 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "added_tokens_decoder": {
3
+ "0": {
4
+ "content": "|||IP_ADDRESS|||",
5
+ "lstrip": false,
6
+ "normalized": true,
7
+ "rstrip": false,
8
+ "single_word": false,
9
+ "special": false
10
+ },
11
+ "1": {
12
+ "content": "<|padding|>",
13
+ "lstrip": false,
14
+ "normalized": false,
15
+ "rstrip": false,
16
+ "single_word": false,
17
+ "special": true
18
+ },
19
+ "50254": {
20
+ "content": " ",
21
+ "lstrip": false,
22
+ "normalized": true,
23
+ "rstrip": false,
24
+ "single_word": false,
25
+ "special": false
26
+ },
27
+ "50255": {
28
+ "content": " ",
29
+ "lstrip": false,
30
+ "normalized": true,
31
+ "rstrip": false,
32
+ "single_word": false,
33
+ "special": false
34
+ },
35
+ "50256": {
36
+ "content": " ",
37
+ "lstrip": false,
38
+ "normalized": true,
39
+ "rstrip": false,
40
+ "single_word": false,
41
+ "special": false
42
+ },
43
+ "50257": {
44
+ "content": " ",
45
+ "lstrip": false,
46
+ "normalized": true,
47
+ "rstrip": false,
48
+ "single_word": false,
49
+ "special": false
50
+ },
51
+ "50258": {
52
+ "content": " ",
53
+ "lstrip": false,
54
+ "normalized": true,
55
+ "rstrip": false,
56
+ "single_word": false,
57
+ "special": false
58
+ },
59
+ "50259": {
60
+ "content": " ",
61
+ "lstrip": false,
62
+ "normalized": true,
63
+ "rstrip": false,
64
+ "single_word": false,
65
+ "special": false
66
+ },
67
+ "50260": {
68
+ "content": " ",
69
+ "lstrip": false,
70
+ "normalized": true,
71
+ "rstrip": false,
72
+ "single_word": false,
73
+ "special": false
74
+ },
75
+ "50261": {
76
+ "content": " ",
77
+ "lstrip": false,
78
+ "normalized": true,
79
+ "rstrip": false,
80
+ "single_word": false,
81
+ "special": false
82
+ },
83
+ "50262": {
84
+ "content": " ",
85
+ "lstrip": false,
86
+ "normalized": true,
87
+ "rstrip": false,
88
+ "single_word": false,
89
+ "special": false
90
+ },
91
+ "50263": {
92
+ "content": " ",
93
+ "lstrip": false,
94
+ "normalized": true,
95
+ "rstrip": false,
96
+ "single_word": false,
97
+ "special": false
98
+ },
99
+ "50264": {
100
+ "content": " ",
101
+ "lstrip": false,
102
+ "normalized": true,
103
+ "rstrip": false,
104
+ "single_word": false,
105
+ "special": false
106
+ },
107
+ "50265": {
108
+ "content": " ",
109
+ "lstrip": false,
110
+ "normalized": true,
111
+ "rstrip": false,
112
+ "single_word": false,
113
+ "special": false
114
+ },
115
+ "50266": {
116
+ "content": " ",
117
+ "lstrip": false,
118
+ "normalized": true,
119
+ "rstrip": false,
120
+ "single_word": false,
121
+ "special": false
122
+ },
123
+ "50267": {
124
+ "content": " ",
125
+ "lstrip": false,
126
+ "normalized": true,
127
+ "rstrip": false,
128
+ "single_word": false,
129
+ "special": false
130
+ },
131
+ "50268": {
132
+ "content": " ",
133
+ "lstrip": false,
134
+ "normalized": true,
135
+ "rstrip": false,
136
+ "single_word": false,
137
+ "special": false
138
+ },
139
+ "50269": {
140
+ "content": " ",
141
+ "lstrip": false,
142
+ "normalized": true,
143
+ "rstrip": false,
144
+ "single_word": false,
145
+ "special": false
146
+ },
147
+ "50270": {
148
+ "content": " ",
149
+ "lstrip": false,
150
+ "normalized": true,
151
+ "rstrip": false,
152
+ "single_word": false,
153
+ "special": false
154
+ },
155
+ "50271": {
156
+ "content": " ",
157
+ "lstrip": false,
158
+ "normalized": true,
159
+ "rstrip": false,
160
+ "single_word": false,
161
+ "special": false
162
+ },
163
+ "50272": {
164
+ "content": " ",
165
+ "lstrip": false,
166
+ "normalized": true,
167
+ "rstrip": false,
168
+ "single_word": false,
169
+ "special": false
170
+ },
171
+ "50273": {
172
+ "content": " ",
173
+ "lstrip": false,
174
+ "normalized": true,
175
+ "rstrip": false,
176
+ "single_word": false,
177
+ "special": false
178
+ },
179
+ "50274": {
180
+ "content": " ",
181
+ "lstrip": false,
182
+ "normalized": true,
183
+ "rstrip": false,
184
+ "single_word": false,
185
+ "special": false
186
+ },
187
+ "50275": {
188
+ "content": " ",
189
+ "lstrip": false,
190
+ "normalized": true,
191
+ "rstrip": false,
192
+ "single_word": false,
193
+ "special": false
194
+ },
195
+ "50276": {
196
+ "content": " ",
197
+ "lstrip": false,
198
+ "normalized": true,
199
+ "rstrip": false,
200
+ "single_word": false,
201
+ "special": false
202
+ },
203
+ "50277": {
204
+ "content": "|||EMAIL_ADDRESS|||",
205
+ "lstrip": false,
206
+ "normalized": true,
207
+ "rstrip": false,
208
+ "single_word": false,
209
+ "special": false
210
+ },
211
+ "50278": {
212
+ "content": "|||PHONE_NUMBER|||",
213
+ "lstrip": false,
214
+ "normalized": true,
215
+ "rstrip": false,
216
+ "single_word": false,
217
+ "special": false
218
+ },
219
+ "50279": {
220
+ "content": "<|endoftext|>",
221
+ "lstrip": false,
222
+ "normalized": false,
223
+ "rstrip": false,
224
+ "single_word": false,
225
+ "special": true
226
+ },
227
+ "50280": {
228
+ "content": "[UNK]",
229
+ "lstrip": false,
230
+ "normalized": false,
231
+ "rstrip": false,
232
+ "single_word": false,
233
+ "special": true
234
+ },
235
+ "50281": {
236
+ "content": "[CLS]",
237
+ "lstrip": false,
238
+ "normalized": false,
239
+ "rstrip": false,
240
+ "single_word": false,
241
+ "special": true
242
+ },
243
+ "50282": {
244
+ "content": "[SEP]",
245
+ "lstrip": false,
246
+ "normalized": false,
247
+ "rstrip": false,
248
+ "single_word": false,
249
+ "special": true
250
+ },
251
+ "50283": {
252
+ "content": "[PAD]",
253
+ "lstrip": false,
254
+ "normalized": false,
255
+ "rstrip": false,
256
+ "single_word": false,
257
+ "special": true
258
+ },
259
+ "50284": {
260
+ "content": "[MASK]",
261
+ "lstrip": true,
262
+ "normalized": false,
263
+ "rstrip": false,
264
+ "single_word": false,
265
+ "special": true
266
+ },
267
+ "50285": {
268
+ "content": "[unused0]",
269
+ "lstrip": false,
270
+ "normalized": true,
271
+ "rstrip": false,
272
+ "single_word": false,
273
+ "special": false
274
+ },
275
+ "50286": {
276
+ "content": "[unused1]",
277
+ "lstrip": false,
278
+ "normalized": true,
279
+ "rstrip": false,
280
+ "single_word": false,
281
+ "special": false
282
+ },
283
+ "50287": {
284
+ "content": "[unused2]",
285
+ "lstrip": false,
286
+ "normalized": true,
287
+ "rstrip": false,
288
+ "single_word": false,
289
+ "special": false
290
+ },
291
+ "50288": {
292
+ "content": "[unused3]",
293
+ "lstrip": false,
294
+ "normalized": true,
295
+ "rstrip": false,
296
+ "single_word": false,
297
+ "special": false
298
+ },
299
+ "50289": {
300
+ "content": "[unused4]",
301
+ "lstrip": false,
302
+ "normalized": true,
303
+ "rstrip": false,
304
+ "single_word": false,
305
+ "special": false
306
+ },
307
+ "50290": {
308
+ "content": "[unused5]",
309
+ "lstrip": false,
310
+ "normalized": true,
311
+ "rstrip": false,
312
+ "single_word": false,
313
+ "special": false
314
+ },
315
+ "50291": {
316
+ "content": "[unused6]",
317
+ "lstrip": false,
318
+ "normalized": true,
319
+ "rstrip": false,
320
+ "single_word": false,
321
+ "special": false
322
+ },
323
+ "50292": {
324
+ "content": "[unused7]",
325
+ "lstrip": false,
326
+ "normalized": true,
327
+ "rstrip": false,
328
+ "single_word": false,
329
+ "special": false
330
+ },
331
+ "50293": {
332
+ "content": "[unused8]",
333
+ "lstrip": false,
334
+ "normalized": true,
335
+ "rstrip": false,
336
+ "single_word": false,
337
+ "special": false
338
+ },
339
+ "50294": {
340
+ "content": "[unused9]",
341
+ "lstrip": false,
342
+ "normalized": true,
343
+ "rstrip": false,
344
+ "single_word": false,
345
+ "special": false
346
+ },
347
+ "50295": {
348
+ "content": "[unused10]",
349
+ "lstrip": false,
350
+ "normalized": true,
351
+ "rstrip": false,
352
+ "single_word": false,
353
+ "special": false
354
+ },
355
+ "50296": {
356
+ "content": "[unused11]",
357
+ "lstrip": false,
358
+ "normalized": true,
359
+ "rstrip": false,
360
+ "single_word": false,
361
+ "special": false
362
+ },
363
+ "50297": {
364
+ "content": "[unused12]",
365
+ "lstrip": false,
366
+ "normalized": true,
367
+ "rstrip": false,
368
+ "single_word": false,
369
+ "special": false
370
+ },
371
+ "50298": {
372
+ "content": "[unused13]",
373
+ "lstrip": false,
374
+ "normalized": true,
375
+ "rstrip": false,
376
+ "single_word": false,
377
+ "special": false
378
+ },
379
+ "50299": {
380
+ "content": "[unused14]",
381
+ "lstrip": false,
382
+ "normalized": true,
383
+ "rstrip": false,
384
+ "single_word": false,
385
+ "special": false
386
+ },
387
+ "50300": {
388
+ "content": "[unused15]",
389
+ "lstrip": false,
390
+ "normalized": true,
391
+ "rstrip": false,
392
+ "single_word": false,
393
+ "special": false
394
+ },
395
+ "50301": {
396
+ "content": "[unused16]",
397
+ "lstrip": false,
398
+ "normalized": true,
399
+ "rstrip": false,
400
+ "single_word": false,
401
+ "special": false
402
+ },
403
+ "50302": {
404
+ "content": "[unused17]",
405
+ "lstrip": false,
406
+ "normalized": true,
407
+ "rstrip": false,
408
+ "single_word": false,
409
+ "special": false
410
+ },
411
+ "50303": {
412
+ "content": "[unused18]",
413
+ "lstrip": false,
414
+ "normalized": true,
415
+ "rstrip": false,
416
+ "single_word": false,
417
+ "special": false
418
+ },
419
+ "50304": {
420
+ "content": "[unused19]",
421
+ "lstrip": false,
422
+ "normalized": true,
423
+ "rstrip": false,
424
+ "single_word": false,
425
+ "special": false
426
+ },
427
+ "50305": {
428
+ "content": "[unused20]",
429
+ "lstrip": false,
430
+ "normalized": true,
431
+ "rstrip": false,
432
+ "single_word": false,
433
+ "special": false
434
+ },
435
+ "50306": {
436
+ "content": "[unused21]",
437
+ "lstrip": false,
438
+ "normalized": true,
439
+ "rstrip": false,
440
+ "single_word": false,
441
+ "special": false
442
+ },
443
+ "50307": {
444
+ "content": "[unused22]",
445
+ "lstrip": false,
446
+ "normalized": true,
447
+ "rstrip": false,
448
+ "single_word": false,
449
+ "special": false
450
+ },
451
+ "50308": {
452
+ "content": "[unused23]",
453
+ "lstrip": false,
454
+ "normalized": true,
455
+ "rstrip": false,
456
+ "single_word": false,
457
+ "special": false
458
+ },
459
+ "50309": {
460
+ "content": "[unused24]",
461
+ "lstrip": false,
462
+ "normalized": true,
463
+ "rstrip": false,
464
+ "single_word": false,
465
+ "special": false
466
+ },
467
+ "50310": {
468
+ "content": "[unused25]",
469
+ "lstrip": false,
470
+ "normalized": true,
471
+ "rstrip": false,
472
+ "single_word": false,
473
+ "special": false
474
+ },
475
+ "50311": {
476
+ "content": "[unused26]",
477
+ "lstrip": false,
478
+ "normalized": true,
479
+ "rstrip": false,
480
+ "single_word": false,
481
+ "special": false
482
+ },
483
+ "50312": {
484
+ "content": "[unused27]",
485
+ "lstrip": false,
486
+ "normalized": true,
487
+ "rstrip": false,
488
+ "single_word": false,
489
+ "special": false
490
+ },
491
+ "50313": {
492
+ "content": "[unused28]",
493
+ "lstrip": false,
494
+ "normalized": true,
495
+ "rstrip": false,
496
+ "single_word": false,
497
+ "special": false
498
+ },
499
+ "50314": {
500
+ "content": "[unused29]",
501
+ "lstrip": false,
502
+ "normalized": true,
503
+ "rstrip": false,
504
+ "single_word": false,
505
+ "special": false
506
+ },
507
+ "50315": {
508
+ "content": "[unused30]",
509
+ "lstrip": false,
510
+ "normalized": true,
511
+ "rstrip": false,
512
+ "single_word": false,
513
+ "special": false
514
+ },
515
+ "50316": {
516
+ "content": "[unused31]",
517
+ "lstrip": false,
518
+ "normalized": true,
519
+ "rstrip": false,
520
+ "single_word": false,
521
+ "special": false
522
+ },
523
+ "50317": {
524
+ "content": "[unused32]",
525
+ "lstrip": false,
526
+ "normalized": true,
527
+ "rstrip": false,
528
+ "single_word": false,
529
+ "special": false
530
+ },
531
+ "50318": {
532
+ "content": "[unused33]",
533
+ "lstrip": false,
534
+ "normalized": true,
535
+ "rstrip": false,
536
+ "single_word": false,
537
+ "special": false
538
+ },
539
+ "50319": {
540
+ "content": "[unused34]",
541
+ "lstrip": false,
542
+ "normalized": true,
543
+ "rstrip": false,
544
+ "single_word": false,
545
+ "special": false
546
+ },
547
+ "50320": {
548
+ "content": "[unused35]",
549
+ "lstrip": false,
550
+ "normalized": true,
551
+ "rstrip": false,
552
+ "single_word": false,
553
+ "special": false
554
+ },
555
+ "50321": {
556
+ "content": "[unused36]",
557
+ "lstrip": false,
558
+ "normalized": true,
559
+ "rstrip": false,
560
+ "single_word": false,
561
+ "special": false
562
+ },
563
+ "50322": {
564
+ "content": "[unused37]",
565
+ "lstrip": false,
566
+ "normalized": true,
567
+ "rstrip": false,
568
+ "single_word": false,
569
+ "special": false
570
+ },
571
+ "50323": {
572
+ "content": "[unused38]",
573
+ "lstrip": false,
574
+ "normalized": true,
575
+ "rstrip": false,
576
+ "single_word": false,
577
+ "special": false
578
+ },
579
+ "50324": {
580
+ "content": "[unused39]",
581
+ "lstrip": false,
582
+ "normalized": true,
583
+ "rstrip": false,
584
+ "single_word": false,
585
+ "special": false
586
+ },
587
+ "50325": {
588
+ "content": "[unused40]",
589
+ "lstrip": false,
590
+ "normalized": true,
591
+ "rstrip": false,
592
+ "single_word": false,
593
+ "special": false
594
+ },
595
+ "50326": {
596
+ "content": "[unused41]",
597
+ "lstrip": false,
598
+ "normalized": true,
599
+ "rstrip": false,
600
+ "single_word": false,
601
+ "special": false
602
+ },
603
+ "50327": {
604
+ "content": "[unused42]",
605
+ "lstrip": false,
606
+ "normalized": true,
607
+ "rstrip": false,
608
+ "single_word": false,
609
+ "special": false
610
+ },
611
+ "50328": {
612
+ "content": "[unused43]",
613
+ "lstrip": false,
614
+ "normalized": true,
615
+ "rstrip": false,
616
+ "single_word": false,
617
+ "special": false
618
+ },
619
+ "50329": {
620
+ "content": "[unused44]",
621
+ "lstrip": false,
622
+ "normalized": true,
623
+ "rstrip": false,
624
+ "single_word": false,
625
+ "special": false
626
+ },
627
+ "50330": {
628
+ "content": "[unused45]",
629
+ "lstrip": false,
630
+ "normalized": true,
631
+ "rstrip": false,
632
+ "single_word": false,
633
+ "special": false
634
+ },
635
+ "50331": {
636
+ "content": "[unused46]",
637
+ "lstrip": false,
638
+ "normalized": true,
639
+ "rstrip": false,
640
+ "single_word": false,
641
+ "special": false
642
+ },
643
+ "50332": {
644
+ "content": "[unused47]",
645
+ "lstrip": false,
646
+ "normalized": true,
647
+ "rstrip": false,
648
+ "single_word": false,
649
+ "special": false
650
+ },
651
+ "50333": {
652
+ "content": "[unused48]",
653
+ "lstrip": false,
654
+ "normalized": true,
655
+ "rstrip": false,
656
+ "single_word": false,
657
+ "special": false
658
+ },
659
+ "50334": {
660
+ "content": "[unused49]",
661
+ "lstrip": false,
662
+ "normalized": true,
663
+ "rstrip": false,
664
+ "single_word": false,
665
+ "special": false
666
+ },
667
+ "50335": {
668
+ "content": "[unused50]",
669
+ "lstrip": false,
670
+ "normalized": true,
671
+ "rstrip": false,
672
+ "single_word": false,
673
+ "special": false
674
+ },
675
+ "50336": {
676
+ "content": "[unused51]",
677
+ "lstrip": false,
678
+ "normalized": true,
679
+ "rstrip": false,
680
+ "single_word": false,
681
+ "special": false
682
+ },
683
+ "50337": {
684
+ "content": "[unused52]",
685
+ "lstrip": false,
686
+ "normalized": true,
687
+ "rstrip": false,
688
+ "single_word": false,
689
+ "special": false
690
+ },
691
+ "50338": {
692
+ "content": "[unused53]",
693
+ "lstrip": false,
694
+ "normalized": true,
695
+ "rstrip": false,
696
+ "single_word": false,
697
+ "special": false
698
+ },
699
+ "50339": {
700
+ "content": "[unused54]",
701
+ "lstrip": false,
702
+ "normalized": true,
703
+ "rstrip": false,
704
+ "single_word": false,
705
+ "special": false
706
+ },
707
+ "50340": {
708
+ "content": "[unused55]",
709
+ "lstrip": false,
710
+ "normalized": true,
711
+ "rstrip": false,
712
+ "single_word": false,
713
+ "special": false
714
+ },
715
+ "50341": {
716
+ "content": "[unused56]",
717
+ "lstrip": false,
718
+ "normalized": true,
719
+ "rstrip": false,
720
+ "single_word": false,
721
+ "special": false
722
+ },
723
+ "50342": {
724
+ "content": "[unused57]",
725
+ "lstrip": false,
726
+ "normalized": true,
727
+ "rstrip": false,
728
+ "single_word": false,
729
+ "special": false
730
+ },
731
+ "50343": {
732
+ "content": "[unused58]",
733
+ "lstrip": false,
734
+ "normalized": true,
735
+ "rstrip": false,
736
+ "single_word": false,
737
+ "special": false
738
+ },
739
+ "50344": {
740
+ "content": "[unused59]",
741
+ "lstrip": false,
742
+ "normalized": true,
743
+ "rstrip": false,
744
+ "single_word": false,
745
+ "special": false
746
+ },
747
+ "50345": {
748
+ "content": "[unused60]",
749
+ "lstrip": false,
750
+ "normalized": true,
751
+ "rstrip": false,
752
+ "single_word": false,
753
+ "special": false
754
+ },
755
+ "50346": {
756
+ "content": "[unused61]",
757
+ "lstrip": false,
758
+ "normalized": true,
759
+ "rstrip": false,
760
+ "single_word": false,
761
+ "special": false
762
+ },
763
+ "50347": {
764
+ "content": "[unused62]",
765
+ "lstrip": false,
766
+ "normalized": true,
767
+ "rstrip": false,
768
+ "single_word": false,
769
+ "special": false
770
+ },
771
+ "50348": {
772
+ "content": "[unused63]",
773
+ "lstrip": false,
774
+ "normalized": true,
775
+ "rstrip": false,
776
+ "single_word": false,
777
+ "special": false
778
+ },
779
+ "50349": {
780
+ "content": "[unused64]",
781
+ "lstrip": false,
782
+ "normalized": true,
783
+ "rstrip": false,
784
+ "single_word": false,
785
+ "special": false
786
+ },
787
+ "50350": {
788
+ "content": "[unused65]",
789
+ "lstrip": false,
790
+ "normalized": true,
791
+ "rstrip": false,
792
+ "single_word": false,
793
+ "special": false
794
+ },
795
+ "50351": {
796
+ "content": "[unused66]",
797
+ "lstrip": false,
798
+ "normalized": true,
799
+ "rstrip": false,
800
+ "single_word": false,
801
+ "special": false
802
+ },
803
+ "50352": {
804
+ "content": "[unused67]",
805
+ "lstrip": false,
806
+ "normalized": true,
807
+ "rstrip": false,
808
+ "single_word": false,
809
+ "special": false
810
+ },
811
+ "50353": {
812
+ "content": "[unused68]",
813
+ "lstrip": false,
814
+ "normalized": true,
815
+ "rstrip": false,
816
+ "single_word": false,
817
+ "special": false
818
+ },
819
+ "50354": {
820
+ "content": "[unused69]",
821
+ "lstrip": false,
822
+ "normalized": true,
823
+ "rstrip": false,
824
+ "single_word": false,
825
+ "special": false
826
+ },
827
+ "50355": {
828
+ "content": "[unused70]",
829
+ "lstrip": false,
830
+ "normalized": true,
831
+ "rstrip": false,
832
+ "single_word": false,
833
+ "special": false
834
+ },
835
+ "50356": {
836
+ "content": "[unused71]",
837
+ "lstrip": false,
838
+ "normalized": true,
839
+ "rstrip": false,
840
+ "single_word": false,
841
+ "special": false
842
+ },
843
+ "50357": {
844
+ "content": "[unused72]",
845
+ "lstrip": false,
846
+ "normalized": true,
847
+ "rstrip": false,
848
+ "single_word": false,
849
+ "special": false
850
+ },
851
+ "50358": {
852
+ "content": "[unused73]",
853
+ "lstrip": false,
854
+ "normalized": true,
855
+ "rstrip": false,
856
+ "single_word": false,
857
+ "special": false
858
+ },
859
+ "50359": {
860
+ "content": "[unused74]",
861
+ "lstrip": false,
862
+ "normalized": true,
863
+ "rstrip": false,
864
+ "single_word": false,
865
+ "special": false
866
+ },
867
+ "50360": {
868
+ "content": "[unused75]",
869
+ "lstrip": false,
870
+ "normalized": true,
871
+ "rstrip": false,
872
+ "single_word": false,
873
+ "special": false
874
+ },
875
+ "50361": {
876
+ "content": "[unused76]",
877
+ "lstrip": false,
878
+ "normalized": true,
879
+ "rstrip": false,
880
+ "single_word": false,
881
+ "special": false
882
+ },
883
+ "50362": {
884
+ "content": "[unused77]",
885
+ "lstrip": false,
886
+ "normalized": true,
887
+ "rstrip": false,
888
+ "single_word": false,
889
+ "special": false
890
+ },
891
+ "50363": {
892
+ "content": "[unused78]",
893
+ "lstrip": false,
894
+ "normalized": true,
895
+ "rstrip": false,
896
+ "single_word": false,
897
+ "special": false
898
+ },
899
+ "50364": {
900
+ "content": "[unused79]",
901
+ "lstrip": false,
902
+ "normalized": true,
903
+ "rstrip": false,
904
+ "single_word": false,
905
+ "special": false
906
+ },
907
+ "50365": {
908
+ "content": "[unused80]",
909
+ "lstrip": false,
910
+ "normalized": true,
911
+ "rstrip": false,
912
+ "single_word": false,
913
+ "special": false
914
+ },
915
+ "50366": {
916
+ "content": "[unused81]",
917
+ "lstrip": false,
918
+ "normalized": true,
919
+ "rstrip": false,
920
+ "single_word": false,
921
+ "special": false
922
+ },
923
+ "50367": {
924
+ "content": "[unused82]",
925
+ "lstrip": false,
926
+ "normalized": true,
927
+ "rstrip": false,
928
+ "single_word": false,
929
+ "special": false
930
+ }
931
+ },
932
+ "clean_up_tokenization_spaces": true,
933
+ "cls_token": "[CLS]",
934
+ "extra_special_tokens": {},
935
+ "mask_token": "[MASK]",
936
+ "model_input_names": [
937
+ "input_ids",
938
+ "attention_mask"
939
+ ],
940
+ "model_max_length": 8192,
941
+ "pad_token": "[PAD]",
942
+ "sep_token": "[SEP]",
943
+ "tokenizer_class": "PreTrainedTokenizerFast",
944
+ "unk_token": "[UNK]"
945
+ }